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Editorial 

Peggy Johnson 

This is the first issue of volume 53 and the start of another 
exciting year for Library Resources and Technical Services 
(LRTS). Please join me in welcoming our new editorial board 
members (Allyson Carlyle, Lewis Brian Day, October Ivins, 
Edgar Jones, Randy Roeder, Carlen Ruschoff, and Sarah 
Simpson) and thanking the board members who completed 
their terms at the end of the 2008 ALA Annual Conference 
in Anaheim (Tschera Harkness Connell, Karla L. Hahn, Sara 
C. Heitshu, Judy Jeng, Bonnie MacEwan, Carolynne Myall, Pat Riva, and Diane 
Vizine-Goetz). Editorial board members help set the direction of the journal and 
serve an essential role as paper referees in the double-blind review process. The 
quality of LRTS depends on their dedication and diligence. I'm honored that the 
Association for Library Collections and Technical Services (ALCTS) has reap- 
pointed me to serve an additional four years as LRTS editor. No one will deny 
that serving as editor of a peer- reviewed journal is challenging and hard work, but 
most editors will agree with me in also saying that it is interesting, informative, and 
(most of the time) fun. I am delighted that Edward Swanson has accepted reap- 
pointment as the LRTS book review editor. Do contact him directly (eswanson® 
eswanson.org) if you are interested in reviewing titles for LRTS. 

This issue presents papers that cover the range of responsibilities that define 
the mission of ALCTS and its nearly five thousand members. Patrick L. Carr 
provides another installment in the familiar LRTS literature review series as he 
explores the themes and important works in the 2006-7 literature about serials 
librarianship. Steven A. Knowlton looks back at the history of cataloging codes 
and the often heated debates that characterized code reform in the 1950s and 
1960s. His premise is that reviewing the debates of the past can prove useful as we 
engage in another spirited conversation about reforming the current cataloging 
code. Stephen Hearn suggests an alternative approach to gathering and analyzing 
catalog data, intended to serve as one possible measure of a technical services 
unit's success in attaining its goals. Do spend some time studying the figures that 
accompany this article. They offer a new way to represent changes in headings 
over time. The final two papers in this issue are "Notes on Operations." LRTS 
publishes papers in this section with the intent to offer innovative approaches 
to challenges faced in many libraries. Marielle Veve reports on a new solution 
developed at the University of Tennessee Libraries to support name authority 
control in Extensible Markup Language (XML) for digitized collections. Rebecca 
L. Mugridge and Jeff Edmunds share insights from the Penn State Libraries' 
experience in developing processes to facilitate batchloading records into the 
online catalog. Much more than "how we did it good" stories, these papers pres- 
ent approaches that can inform practice in other libraries. 

The success of LRTS depends on the quality of the papers published, and 
these papers are written by you! Consider the issues you have been pondering, 
the challenges you have been addressing, and the future of libraries and the pro- 
fession of librarianship. Reflecting thoughtfully on these topics is the first step in 
writing a paper. Writing a paper enhances your knowledge and expertise. Why 
not write a paper and submit it to LRTS? 
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From Innovation to 
Transformation 

A Review of the 2006-7 
Serials Literature 

By Patrick L. Carr 

This paper reviews the leading trends in and contributions to the peer-reviewed 
and professional literature of serials librarianship published in 2006 and 2007. 
The review shows that a central topic in the literature is the nature and effect of 
libraries' ongoing transition from acquiring serials in print to providing access 
electronically. Propelled forward by user preferences, this transition is reflected 
in publications that reconceptualize collections and describe innovative initiatives 
and strategies for acquisition, access, and management. Throughout the literature, 
the review traces a prevailing sentiment that libraries are advancing well beyond 
the confines of print-centered models and are assuming new roles, imagining new 
possibilities, and developing new solutions. 

The literature of serials librarianship published in 2006 and 2007 reveals a field 
in rapid transition. The changes occurring range from the shifting nature of 
serial collections to evolving models, initiatives, and management strategies used 
to acquire and administer access to these collections. According to Plutchak, seri- 
als librarianship and scholarly communication as a whole are currently in a period 
of innovation in which emerging technologies are ceasing their emulation of the 
past and revealing extraordinary new possibilities. 1 Plutchak believes that this 
period will culminate in the transformation of scholarly communication so that 
technology "overturns the capabilities that were previously thought to be the pin- 
nacle, and brand new ways of doing things become possible." 2 From this perspec- 
tive, the 2006-7 serials literature might be said to offer a first, nascent glimpse of 
the landscape stretching before libraries as they pioneer their way from a period 
of innovation to one of transformation. Indeed, there is a prevailing sentiment 
in the literature that libraries have advanced well beyond the confines of print- 
centered models in their strategies for acquiring and administering serial access. 
The literature shows libraries assuming new roles, imagining new possibilities, 
and developing new solutions. 

This paper, the latest entry in LRTS' ongoing series reviewing the serials litera- 
ture, starts where Genereux's review of the 2004-5 literature left off 3 It examines 
the peer-reviewed and professional literature of serials librarianship published in 
2006 and 2007. The primary resource for identifying publications to include in the 
review was Library Literature and Information Science. In addition, citations in 
publication reference lists, postings on electronic discussion lists, and serendipi- 
tous discovery all contributed to forming the body of literature that was examined. 
Within this body of literature, the criteria for selecting publications to review was 
based on the author's judgment of which publications most fully exemplify the 
leading trends in and contributions to serials librarianship s literature. 



4 Carr 



LRTS 53(1) 



The first section of the review, "Collections and 
Concepts," takes a broad perspective, surveying the forces 
that the literature indicates are reshaping the nature of 
serials in libraries. Specifically, it reviews changes in the 
use, formats, and cost of serials and analyzes the effect of 
these changes on how serials are defined. The next section, 
"Acquisition," considers the literature's discussion of the 
evolving means through which serial access is acquired. In 
addition to assessing the current state of publisher packages, 
it gives particular attention to the effect of the open access 
(OA) movement and acquisition models that shift empha- 
sis from ownership to access. The third section, "Access," 
examines publications describing libraries' three primary 
serial access points: online catalogs, link resolvers, and 
metasearch engines. The fourth section, "Management," 
reviews the literature's discussion of how the managers of 
serial collections are responding to new challenges and 
opportunities. It focuses on how these managers can suc- 
cessfully communicate, achieve change, and improve work- 
flows and organizational structures. The final section of the 
review, "Initiatives," describes what the literature indicates 
to be the leading efforts to develop initiatives resulting in 
the enhanced acquisition, administration, evaluation, and 
archiving of serials. 

Given the far-reaching scope of the serials literature, 
this review cannot be comprehensive. Among the excluded 
topics are citation analyses, publishing costs, marketing, the 
storage and retention of print serials, institutional reposito- 
ries, and the OA movement's effect on the publishing indus- 
try and scholarly communication. In addition, this review 
is restricted to literature written in English and places an 
emphasis on publications geared toward librarians in the 
United States, Canada, and the United Kingdom. 

Collections and Concepts 

A central topic in the 2006-7 literature is the nature and 
effect of libraries' ongoing transition from acquiring seri- 
als in print to providing access electronically. This transi- 
tion is being propelled forward by user preferences and is 
manifesting itself in evolving collection formats, costs, and 
concepts of seriality. 

Use Studies 

As Johnson and Luther conclude from their interviews with 
twenty-four librarians and publishers, user preferences are 
among the primary forces reshaping serial collections. 4 Use 
studies published in 2006 and 2007 show preferences for 
e-serials among a variety of communities. A representa- 
tive study is Brady, McCord, and Galbraith's analysis of the 
2003 print and e-serial use of researchers at Washington 



State University's Owen Science and Engineering Library. 3 
Comparing the results of their analysis with a previous 
study conducted at the same site, the authors discovered 
that use of the library's serial collection in electronic formats 
increased from 71 percent of total use in 2001 to 94 percent 
of total use in 2003. The authors believe their findings show 
a "cultural shift" in user preferences. 6 Rowlands s review of 
e-serial use studies published in the professional literature 
offers further evidence for users' preferences for accessing 
serials electronically. 7 One of the author's key findings is, 
"Where implemented, electronic versions of journals have 
displaced print use dramatically and at a much faster rate 
than many anticipated." 8 

Voorbij and Ongering discuss reasons for users' prefer- 
ences for e-serials in their survey of Danish faculty con- 
ducted in 2003 and 2004. 9 The authors found that the most 
cited reasons for using e-serials over their print counter- 
parts are e-serials' enhanced functionalities (e.g., the abil- 
ity to perform full-text searches and use hyperlinks within 
articles) and increased accessibility. In their survey of the 
academic staff within the Consortium of Academic Libraries 
of Catalonia, Borrego and colleagues provide a picture of 
e-serial use as it relates to users' discipline and age. 10 Use 
was highest among researchers in biomedicine, engineer- 
ing, and the exact and natural sciences, who use e-serials 
either primarily or entirely, and lowest among researchers 
in the social sciences and humanities, who primarily use 
print serials. The authors also learned that e-serial use is 
prevalent among researchers under the age of forty, while 
most researchers over the age of fifty-one persist in access- 
ing serials in print. 

Format 

Libraries have responded to users' preferences by transition- 
ing to e-serials. Prabha documents this in an analysis of the 
formats in which members of the Association of Research 
Libraries (ARL) subscribe to a sample of 515 serials. 11 
From 2002 to 2006, ARL libraries' print subscriptions to the 
sample serials dwindled by 32 percent while electronic sub- 
scriptions grew by 34 percent. Prabha's research also shows 
that the period from 2005 to 2006 was a watershed in which, 
for the first time, electronic subscriptions to the sample 
serials surpassed print subscriptions. Hahn gives further 
evidence for the shift to e-serials in a 2005 survey assessing 
the participation of eighty-nine ARL libraries in serial pack- 
ages offered by five large publishers: Blackwell, Elsevier, 
Springer, Taylor and Francis, and Wiley. 12 Of the packages 
that respondents indicated they were participating in for 
2006, 58 percent involved the cancellation of print versions 
of the serials within the packages. This fact leads Hahn to 
conclude that libraries are swiftly moving to electronic- 
only formats for serials within publisher packages. Drawing 



53(1) LRTS 



From Innovation to Transformation 5 



on their interviews with librarians and publishers, Johnson 
and Luther predict that this trend will continue: "Although 
the pace and likely ultimate extent of the transition differs 
from institution to institution, all are moving along a con- 
tinuum from print-only to dual-media to e-only journals." 13 
In the near future, they speculate, it is possible that all but 
5 percent of many libraries' serial collections will only be 
accessible electronically. 

Redefining Serials 

Changes in the formats of serial collections have introduced 
deeper questions regarding the nature of seriality. In Soule's 
review of the evolving definitions that libraries have applied 
to serials over the past fifty years, the author comments 
that a challenge libraries will face in their future efforts to 
define a serial is the "increasing fragmentation of informa- 
tion" in a digital world. 14 Soules contemplates whether this 
fragmentation might someday manifest itself in a decision 
by publishers to abandon efforts to organize serials into 
units such as volumes and issues and instead make articles 
accessible electronically as they are ready for publication. 
Van Orsdel foresees a similar disaggregation, commenting 
that libraries are experiencing "a seeming shift of interest to 
the piece rather than the container, the article rather than 
the journal, the definition rather than the dictionary." 13 In 
Plutchak's view, the outcome of this shift is that "the serial 
as defined by the librarian is an anachronism in the digital 
age, and will not survive for long." 16 The author argues that, 
in the current period of transition, the attempt to clearly 
define a serial is futile. While acknowledging that, at pres- 
ent, the article remains prevalent, Plutchak anticipates that 
data sets and social networking tools have a revolutionary 
potential. 

Cost 

The evolving nature of serials has resulted in complex 
changes in the size and average unit cost of library collec- 
tions. An ARL report shows that, following fifteen years of 
stagnation, the number of serials purchased by member 
libraries skyrocketed by approximately 64 percent from 
2001 to 2005. 17 The report further indicates that the average 
unit cost of a subscription has decreased by approximately 
23 percent from 2000 to 2005. Explaining the factors behind 
these trends, Kyrillidou points to libraries' dual-format 
subscriptions (e.g., a print plus online subscription), which, 
according to ARL guidelines, should be counted twice. 18 
Other contributing factors cited by the author include con- 
sortial arrangements and libraries' transitions to online-only 
subscriptions, which are sometimes less costly than sub- 
scriptions in other formats. 

Libraries' expenditures further reflect the transition to 



e-serials. ARL statistics suggest that, for the period from 
1994-95 to 2004-5, member libraries' e-serials expenditures 
have ballooned by over 1,600 percent. 19 Libraries' overall 
serials expenditures have also experienced rapid increases. 
Since 1986, for example, ARL libraries' serials expenditures 
have increased by 302 percent, a rate of growth that signifi- 
cantly exceeds increases in the annual consumer price index 
over the same period. 

Rising subscription costs is one of the primary factors 
affecting these complex changes in collection sizes, average 
unit costs, and expenditures. Reviewing the costs of seri- 
als listed in three databases produced by the Institute for 
Scientific Information as well as EBSCO's Academic Search 
Premier database, Van Orsdel and Born estimate that 
academic libraries in the United States experienced 2007 
subscription cost increases of 9 percent for domestic serials 
and 7.3 percent for foreign serials. 20 The authors predict 
that 2008 subscription costs will increase by an additional 
7-9 percent. White and Creaser provide added documen- 
tation of the inflating costs of subscriptions. 21 Examining 
data that Swets Information Services provided for the 
subscription costs of eight commercial publishers and three 
university presses, the authors calculate overall price infla- 
tion of approximately 39 percent between 2000 and 2006. 
Moghaddan further contributes to the literature's discussion 
of pricing through a comparison of the 2003 subscription 
costs of serials from five commercial publishers and five 
nonprofit publishers. 22 Among the author's findings is that 
the average subscription cost of the commercial publishers' 
serials exceeded the average subscription cost of the non- 
profit publishers' serials by approximately 280 percent. 

Acquisition 

As a result of rising subscription costs, predictions regarding 
the sustainability of established acquisition models can be 
dire. Van Orsdel, for example, warns that "library budgets 
are, and will continue to be, no match for journal price infla- 
tion or for the cost of new journals as they appear." 23 The 
author suggests that a key component to overcoming this 
crisis is developments in the marketplace that foster com- 
petition and elasticity. The 2006-7 literature discusses both 
established acquisition models and their alternatives. 

Publisher Packages 

The literature shows that the bundling of serials into pub- 
lisher packages continues to be a prevalent acquisition 
model. Hahn documents this prevalence in a 2005 survey 
assessing the participation of eighty-nine ARL libraries in 
serial packages offered by five large publishers: Blackwell, 
Elsevier, Springer, Taylor and Francis, and Wiley. 24 Most 
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respondents (93 percent) subscribed to at least one of the 
publishers' packages, and, on average, respondents sub- 
scribed to packages offered by three of the publishers. The 
two most cited reasons for participation in packages were 
that "content and access offered were a good return on the 
investment" and "alternative non-bundled forms of access 
to the content were prohibitively expensive." 25 Together, 
these responses lead Hahn to speculate that libraries' par- 
ticipation in packages indicates that they "may be making 
the best of a bad situation." 26 The survey further shows 
that fifty respondents have had one or more cancellation 
projects for the subscription years 2004-6, and 66 percent 
of these fifty respondents have protected packages from 
cancellation. Hahn notes that the implication of this is that 
other portions of the respondents' collections have suffered 
more significant cuts. Ultimately, the author argues that the 
survey's results demonstrate that publishers should offer 
packages with terms and pricing structures that are more 
accommodating to the needs of libraries. 

The OA Movement 

The OA movement, which aims to make research freely 
available online, constitutes a central effort to transform 
scholarly communication. Although the body of literature 
discussing and debating the OA movement extends outside 
the boundaries of serials librarianship, several noteworthy 
publications examine a topic directly affecting libraries' 
serial acquisitions: the correlation between the growth of 
the OA movement and library subscriptions. 

From the results of a survey of 340 librarians, Ware 
concludes that, for the time being, libraries do not generally 
consider the availability of OA content to warrant the cancel- 
lation of subscriptions. 2 ' Among the factors leading to this 
conclusion are that librarians do not see OA content as an 
acceptable or reliable substitute for a subscription. Likewise, 
librarians possess neither an awareness of nor plans to analyze 
the overlap between subscribed and OA content. However, 
Ware also found that 81 percent of respondents believe the 
availability of OA content would be "very important" or 
"important" in forming cancellation decisions. 28 Moreover, 
while 32 percent of respondents assured publishers that they 
should not be worried about cancellations, 54 percent felt 
that it was too soon to make such a determination. Beckett 
and Inger's subsequent survey of 424 librarians portrays the 
OA movement as a greater threat to the continuation of 
libraries' subscriptions. 29 Approximately 40 percent of the 
survey's respondents indicated that they feel it is wasteful 
for a library to subscribe to serials with content that is freely 
accessible online. Citing findings such as these, Beckett and 
Inger conclude that "a significant number of librarians are 
likely to substitute OA materials for subscribed resources, 
given certain levels of reliability, peer review, and currency 



of the information available." 30 

In an editorial appearing in Learned Publishing, 
Anderson echoes the sentiments expressed in the findings 
of Beckett and Inger. 31 He comments that "it is highly likely 
that rational individuals and libraries will cancel subscrip- 
tions to journals whose content is immediately, freely, easily, 
and reliably available at no charge." 32 Some commentators, 
however, foresee the coexistence of subscriptions and the 
availability of OA content. Pinfield, for example, examines 
four possible scenarios for the future of scholarly communi- 
cation and concludes that subscriptions and the OA move- 
ment can be viewed as complimentary models rather than 
competitors. 33 For coexistence to occur, Pinefield believes 
that a number of major changes need to be instituted by 
both OA repository administrators and publishers. These 
changes include 

widespread deployment of repository infrastruc- 
ture, development of version identification stan- 
dards, development of value-added features, new 
business models, [and] new approaches to quality 
control and adoption of digital preservation as a 
repository function. 34 

Acquisition and Ownership 

The OA movement is not the only threat to established 
acquisition models. As Anderson states, "The arguments for 
traditional collection development are losing their strength 
with every passing day." 3 Competing with these traditional 
arguments are models focused on acquisition of access with- 
out ownership. Carroll and Brink describe a project at the 
University of New Hampshire (UNH) Library that exem- 
plifies this trend. 36 Beginning in August 2003, UNH opted 
to meet users' growing access needs through a document 
delivery service rather than the initiation of new subscrip- 
tions. The authors deem the project a successful strategy 
for reducing expenditures and comment that UNH hopes 
to cancel little-used and high-cost subscriptions and instead 
provide access to these serials through a document delivery 
service. 

Offering further evidence of libraries' exploration of 
nontraditional acquisition models are articles that have been 
written to assess the full-text access that aggregated data- 
bases provide to serials in specific disciplines. 3 ' Together, 
these articles suggest a growing interest in leasing con- 
tent through aggregated databases (which typically do not 
ensure perpetual access) rather than owning the content 
through a subscription with perpetual access provisions. 
Stemper and Barribeau document the trend toward acquir- 
ing access without ownership in an article that received the 
2007 Best of LRTS Award. 38 The authors' literature review 
and informal survey suggests that more than 80 percent of 
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research libraries will enter into an agreement regardless of 
whether the agreement ensures that the access acquired is 
perpetual. 

In an article that received the 2007 Blackwell Scholarship 
Award, Atkinson asserts that this willingness to acquire 
access without ownership represents "the greatest single 
failure of research libraries in the past decade." 39 Several 
publications advocating that libraries secure perpetual 
access rights reflect this perspective. In their analysis of fifty 
serial and aggregator license agreements entered into by 
the University of Minnesota, Stemper and Barribeau found 
that a majority of these agreements (64 percent) include 
provisions for perpetual access. 40 Although these provisions 
often included loopholes, vague wording, and specifications 
of additional fees, the authors nevertheless deem their find- 
ings heartening. However, they temper their optimism by 
emphasizing that publishers' willingness to grant perpetual 
access rights is only of value if libraries pursue these rights. 

Kenney and colleagues further stress the importance of 
securing perpetual access. 41 Drawing on interviews in which 
they assess archiving concerns voiced by fifteen library 
directors, the authors analyze twelve archiving programs. 
The conclusions derived from this analysis convey a sense of 
urgency. Kenny and colleagues state that 

current license agreements are inadequate to pro- 
tect a library's long-term interest in electronic 
journals, that individual libraries cannot address 
the preservation needs of e-journals on their own, 
that much scholarly e-literature is not covered by 
archiving arrangements, and that while e-journal 
archiving programs are becoming available, no 
comprehensive solution has emerged and large 
parts of the e-literature go unprotected. 42 

In light of this finding, they recommend that libraries, 
publishers, and archiving programs strive to enhance com- 
munication, coordinate efforts, advocate change, and make 
meaningful commitments to participating in initiatives. 
Publications describing these initiatives are reviewed in the 
"Initiatives" section of this paper. 



Access 

Issues related to access were a focal point in the 2006-7 
serials literature. Perhaps the broadest contribution on this 
topic is O'Hara's analysis of the results of a 2005 survey 
assessing how 145 academic libraries make their e-serials 
accessible. 43 The survey's findings suggest that libraries are 
generally relying on three access points: online catalogs, link 
resolvers (included Web-based lists powered by link resolv- 
ers), and metasearch engines. 



Online Catalogs 

One important conclusion derived from O'Hara's survey is 
that libraries have not reached a consensus as to the best 
strategies for providing access to serials within the online 
catalog. 44 Perhaps more than anywhere else, this is apparent 
in libraries' varying decisions regarding whether different 
versions of a serial (e.g., electronic, print, and microform) 
should be represented by separate catalog records or a 
single record. In O'Hara's survey, the decisions of respon- 
dents varied considerably, with approximately the same 
number of libraries moving from a single record approach 
to a separate record approach as were doing the opposite. 
According to Allgood, "This multiple versions (MulVer) 
problem represents a defining challenge of the automated 
catalog era." 45 In the author's in-depth investigation of 
the problem, three closely related possibilities for resolu- 
tion are discussed: the replacement of Anglo-American 
Cataloging Rules, 2nd ed., with Resource Description and 
Access; adoption of the International Federation of Library 
Associations and Institutions' Functional Requirements for 
Bibliographic Records (FRBR) model; and utilization of 
Machine-Readable Cataloging (MARC) 21 authority, biblio- 
graphic, and holdings formats. 

Of these three possible resolutions identified by 
Allgood, FRBR constitutes the core theoretical groundwork 
for addressing the MulVer problem. As described by Shadle, 
FRBR is a model that "can be used to support the ability 
of users to find, identify, select, and obtain bibliographic 
resources." 46 Shadle explains that the model represents 
bibliographic resources within a hierarchy consisting of four 
levels: 

• Work: A distinct intellectual or artistic creation 

• Expression: The intellectual or artistic realization of 
a work 

• Manifestation: The physical embodiment of an 
expression 

• Item: A single exemplar of a manifestation 47 

Within this model, multiple versions of a serial can 
be conceptualized as multiple manifestations of a single 
expression. For example, Allgood shows that the New York 
Times can be viewed as a single expression with electronic, 
microform, and print manifestations. As a result, integrated 
library system (ILS) developers have a framework for 
structuring information within online catalog displays that 
facilitates user navigation between multiple versions of a 
serial. Indeed, Allgood believes that an online catalog offer- 
ing users a "tree-like display for works with multiple expres- 
sions or manifestations represents one of the most intriguing 
potential features of the FRBR model for library OPACs." 48 
This statement, in turn, is representative of Allgood's overall 
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contention that the greatest promise for a short-term resolu- 
tion of the MnlVer problem rests in enhancements that ILS 
developers can make to user interfaces. While the realities 
of current bibliographic control dictate that catalogers con- 
tinue "to store and exchange data as cohesive manifestation- 
level description," Allgood asserts that librarians should 
advocate the development of interfaces addressing the 
MulVer problem through enhanced capabilities for record 
indexing and display. 49 

Collins and colleagues offer an example of an effort 
to address the MulVer problem through an enhanced 
online catalog interface. 50 They discuss a project in which 
North Carolina State University (NCSU) Libraries and 
Endeca Technologies collaborated to develop and imple- 
ment Endeca as the user interface of the libraries' online 
catalog. Collins and colleagues explain that the Endeca 
interface has the potential to automatically "connect or 
'FRBRize on the front end'" different manifestations of 
the same serial expression. 31 They add, however, that, while 
the interface could show connections between records, the 
absence of an identifier in the MARC record for a work 
prevents the interface from "display[ing] a hierarchical view 
of the serial work." 52 

An additional barrier to effective serial access within the 
online catalog is discussed in a special section of the Serials 
Librarian featuring four articles examining the relative 
advantages and disadvantages of latest and successive entry 
cataloging. 33 These articles discuss whether cataloging codes 
should retain the convention of cataloging serials accord- 
ing to latest entry, which can force users to search through 
several records to find the one that is needed. As with the 
MulVer problem, these articles look to FRBR and enhanced 
interfaces as possible resolutions. 54 

Link Resolvers 

O'Hara's 2005 survey of 145 academic libraries revealed that 
link resolvers were used as an e-serial access point by 74 per- 
cent of respondents. 55 This finding leads O'Hara to conclude 
that the technology, which can be used to generate Web- 
based serial lists, is "becoming a second library catalogue for 
serials." 56 Apps and Maclntyre discuss how a link resolver 
works, explaining that the technology supports context- 
sensitive linking by enabling a library's authenticated users 
to seamlessly link from a citation in a database to options 
that the library offers for accessing the cited content. 37 
Beyond this core function, articles have explored additional 
roles that a link resolver can play. 58 These additional roles 
include providing data for analyzing users' search patterns 
and generating links from citations in the online catalog and 
free online resources (e.g., Google Scholar, Windows Live 
Academic, and Open WorldCat, now WorldCat.org). 

The widespread implementation of link resolvers has 



resulted in articles that compare and assess specific prod- 
ucts. For example, Livingston, Sanford, and Bretthauer 
describe a project to determine the best link resolver for the 
University of Connecticut Libraries (UCL) through an inves- 
tigation of other libraries' experiences using link resolvers. 59 
Drawing on the results of a literature review, surveys, and 
on-site visits, the authors were able to make in-depth com- 
parisons between three products: Ex Libris SFX, Endeavor 
LinkFinderPftts, and Serials Solutions Article Linker. SFX 
was ultimately selected as being the best fit for the needs of 
UCL. Among the factors leading to this decision were SFX's 
accuracy, flexibility, low maintenance requirements, large 
market share, and detailed reports and use statistics. 

Wakimoto, Walker, and Dabbour assess users' and librar- 
ians' experiences with the SFX link resolver. 60 Working in the 
San Marcos and Northridge campuses of the California State 
University System, the authors conducted online surveys of 
users, focus groups of librarians, analyses of use statistics, 
and test searches. In the case of users' experiences, they 
found that, by a small margin, expectations regarding SFX 
exceeded users' level of satisfaction. Librarians were gener- 
ally satisfied but expressed unease with inaccurate informa- 
tion that SFX sometimes provided concerning accessible 
content. The authors note that, in general, complaints were 
not due to deficiencies of SFX itself but instead involved the 
databases that SFX links to and from. 

The enhancement of link resolvers is the subject of a 
report by Culling, who recommends means of improving 
coordination and communication of information in the 
knowledge bases powering link resolvers. 61 Drawing primar- 
ily on the results of interviews with representatives of the 
various parties involved in managing link resolver knowl- 
edge bases, the author describes the nature of the knowl- 
edge base supply chain and the relationship of the various 
stakeholders in this chain. Culling finds misunderstandings 
and poor coordination throughout the chain and recom- 
mends the development of an organization that "would seek 
to bring stakeholders together to define a visible code of 
practice for effective participation in the knowledge base 
supply chain." 62 Furthermore, the author advocates that 
stakeholders increase their partnerships with subscription 
agents while taking a proactive stance in applying tools for 
the automated exchange of knowledge base information. 

Metasearch Engines 

In O'Hara's 2005 survey of 145 academic libraries, 30 
percent of respondents reported that they make e-serials 
accessible through a metasearch engine, which enables a 
user to search multiple databases simultaneously. 63 The 
nature and effect of metasearch engines as access points 
is the subject of a special section of a 2006 issue of Serials 
Review. 64 A central focus of a number of the articles in this 
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section are the development and features of specific metase- 
arch engines on the marketplace, including SwetsWise 
Searcher, Endeavor Discovery: Finder, WebFeat Express, 
and Muse Metasearch Engine. 65 In addition, this section 
provides guidelines for the selection and implementation of 
a metasearch engine. For example, Highsmith and Ponsford 
discuss Texas A&M University Libraries' implementation 
of Ex Libris' metasearch engine, MetaLib. 66 Tracing a pro- 
cess that extended from fall 2004 through January 2006, 
Highsmith and Ponsford describe the stages of implementa- 
tion, including database testing and configuration, interface 
customization, prerelease user testing, beta testing, and staff 
and user training. 

Lindahl contributes another perspective on the imple- 
mentation of a metasearch engine. 6 The author contends 
that most commercial products' out-of-the-box interfaces 
make the metasearch process more complex and time- 
consuming than necessary. Drawing on the University 
of Rochester River Campus Libraries' development and 
enhancement of its metasearch engine, Find Articles, 
Lindahl offers a case study of how a library can collaborate 
with stakeholders to customize its metasearching capa- 
bilities so that they more effectively meet users' needs 
and expectations. Walker adds to the literature's discussion 
of innovations to metasearch engines by extending focus 
from locally implemented enhancements to industrywide 
standards being developed by the National Information 
Standards Organization (NISO). 68 The author explains that 
the goals of the NISO Metasearch Initiative are threefold. 
These goals are to empower 

• metasearch service providers to offer more effective 
and responsive services; 

• content providers to deliver enhanced content and 
protect their intellectual property; and 

• libraries to deliver services that are distinguished 
from those offered by Google and other free Web 
services. 69 



Management 

As serial collections, acquisitions, and access points are evolv- 
ing, so too are management strategies. The 2006-7 literature 
features an abundance of publications describing how the 
transition to e-serials is leading managers to achieve change 
by enhancing workflows and communication channels. 

Achieving Change 

At the core of managers' efforts at enhancement is an ability 
to achieve change. White explores this topic in a discussion 
of the University of Memphis's implementation of staffing 



changes at the libraries' periodicals desk. 70 Following an 
analysis of different change models, White states that the 
libraries' plan included five steps: "defining the changing, 
creating a common goal, involving the staff, providing an 
opportunity for feedback, and providing an opportunity 
to learn and grow." 1 Ohler contributes an additional per- 
spective.' 2 Drawing heavily on the professional literature, 
she discusses four components to achieving change that 
any manager must grasp: "(1) The information and serials 
environment, (2) organizational structure and culture, (3) 
workflow analysis and staff resources, and (4) the implemen- 
tation and use of technology." 3 A key concept emphasized 
throughout Ohler's analysis is the importance of cultivating 
an attitude of openness in adapting to users' expectations, 
in fostering communication within an organization, and in 
implementing the tools and technologies needed to manage 
e-serials. 

Workflow Analysis and Reorganization 

Managers cannot apply their knowledge of how to achieve 
change without first being aware of when change is needed. 
Yue and Anderson describe how the University of Nevada, 
Reno Libraries increased their awareness on this account 
through the development of a flowchart depicting the 
libraries' workflows for managing e-serials.' 4 They explain 
that, through its illustration of procedures, the flowchart 
has enabled the libraries to identify ways to clarify responsi- 
bilities, streamline operations, and eliminate inefficiencies. 
Graves and Arthur give another example of the benefits 
of analyzing serial workflows.' 5 They discuss a project that 
the Serials Unit of Old Dominion University Libraries con- 
ducted to assess workflows and resource allocations during 
the libraries' transition from print to e-serials. The most 
influential outcomes of this analysis were the establishment 
of a Serials and Electronic Resources Unit and the transfor- 
mation of the titles and responsibilities of two librarian posi- 
tions so that these positions can better coordinate e-serial 
management. 

As libraries have updated their workflows to address 
the challenges of e-serials, the need for traditional, print- 
centered procedures has been called into question. Anderson 
argues that libraries should adopt practices that are more 
representative of users' preferences for accessing serials 
electronically. 6 In doing so, Anderson cites four examples 
of tasks that are not always a prudent allocation of time 
and resources: claiming, binding, subject authority con- 
trol, and unessential customization of records. Borchert 
describes one library's effort to discontinue a fundamental 
procedure in print serial management: check-in." During 
the University of South Florida Tampa Library's migration 
to a new ILS, managers opted to stop routine serial check- 
in. Due to such factors as the arrangement of the library's 
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collection according to the Library of Congress classification 
system and the library's commitment to continue binding 
serials, Borchert reports that the experiment led the library 
to conclude that check-in is still necessary. 

Frost and Woo discuss a similar workflow change, 
this one consisting of the elimination of binding at Hong 
Kong Baptist University Library. 78 Low use of print serials 
combined with increasing subscription and binding costs 
resulted in the authors' recommendation that the library 
discontinue binding all currently received serials that are 
either (1) accessible perpetually online, (2) accessible online 
(regardless of perpetual access provisions) and used less than 
five times per year, or (3) unscholarly newsletters. Instead of 
binding these materials, which constitute over 85 percent of 
the library's currently received serial collection, the authors 
advocate that noncurrent issues be stored in boxes. 

Communication 

E-serials are also changing managers' communication chan- 
nels. Feather explores these changes in a discussion of Ohio 
State University Libraries' analysis of e-resource manage- 
ment communications.' 9 The analysis aimed to develop an 
awareness of the nature, structure, and role of the varying 
types of e-resource communication occurring at the librar- 
ies. Feather reports that this awareness enabled the libraries 
to enhance communication by 

updating and improving online request forms, 
reducing the number of individuals involved in 
certain workflow communications, reducing the 
number of inappropriate messages sent to an 
e-resources unit group e-mail account, spreading 
awareness among other staff about the e-mail clut- 
ter caused by notifying too many individuals of a 
problem, and encouraging library-wide staff view- 
ing of E RMS records. 80 

Other publications shift the focus from internal com- 
munications to communications between libraries and 
their external partners. For example, Robertson reports 
that Strader, Roth, and Boissy presented at the 2005 North 
American Serials Interest Group Annual Conference on 
how libraries can better collaborate with publishers and 
subscription agents. 81 The presenters proposed a checklist 
outlining the responsibilities that each party has in ensuring 
a libraries' e-serial access is activated and retained. 



Initiatives 

The challenges libraries face in the management of their 
serial collections have led to the development of innovative 



partnerships among libraries, publishers, subscription 
agents, and other stakeholders. The initiatives resulting 
from these partnerships are a major topic of discussion in 
the 2006-7 literature. 

Acquisition and Administration 

With the transition to e-serials, acquisition increasingly 
necessitates the negotiation of a license agreement, which is 
a complex task involving a significant investment of time and 
expertise. Hahn describes one effort to simplify this under- 
taking: NISO's Shared E-resources Understanding (SERU) 
Working Group. 82 Through its development of a best prac- 
tices document that both a library and publisher can honor, 
the SERU Working Group offers a pragmatic alternative to 
license negotiations. Hahn explains that by accepting the 
terms of the document, both parties can forgo negotiations, 
thereby streamlining the acquisition process. 

Beyond license negotiations, acquisition and admin- 
istration require that libraries, publishers, and subscrip- 
tion agents exchange metadata regarding serial access and 
availability. Miller and Klemperer discuss how the NISO/ 
EDItEUR Joint Working Party for the Exchange of Serials 
Subscription Information has enhanced this process through 
its development of three Online Information Exchange 
(ONIX) formats: Serials Products and Subscriptions, Serials 
Online Holdings, and Serials Release Notice. 83 Among the 
positive outcomes that libraries can achieve through these 
standards are a reduction in unneeded claims for print 
issues, the automation of URL changes in a library's access 
portals, and the reconciliation of holdings in preparation for 
package deals. 

Following the acquisition of an e-serial, a library must 
effectively record, track, and communicate the business 
and licensing terms. The central tool that libraries rely on 
to complete this task is an electronic resource management 
system (ERMS). While the literature of previous years cen- 
tered on the introduction of ERMS, the 2006-7 literature 
places increased focus on efforts to enhance these systems. 
Fons and Jewell, for example, discuss the second phase 
of the Digital Library Federation's Electronic Resources 
Management Initiative (ERMI). 84 The authors characterize 
the 2004 report resulting from the initial phase of ERMI 
as a "key document for the development of ERMS" and 
explain the ways in which the second phase of ERMI will 
further enhance e-resource management. 85 Among the 
enhancements they cite are a review and update of the first 
phase's Data Dictionary and the facilitation of opportuni- 
ties through which librarians can use this document to map 
licensing terms to ERMS fields. Other focal points include 
the integration of ERMS with ILS, link resolvers, and stan- 
dards for evaluating e-resource use. 

While many of the ERMS available to libraries are 



53(1) LRTS 



From Innovation to Transformation 1 1 



commercial products, other systems have been developed 
by libraries themselves. For example, Meyer describes 
E-Matrix, an ERMS developed by NCSU Libraries, and 
Stranack describes CUFTS, an open-source serial manage- 
ment software system developed by Simon Fraser University 
Library. 86 Discussing the lessons learned from implementing 
a homegrown ERMS, Meyer advises that libraries opting to 
take this path will need personnel with significant expertise 
in both programming and e-resource management. 

Evaluation 

The literature's discussion of the evaluation of serial use 
centers around two initiatives: Counting Online Usage 
of Networked Electronic Resources (COUNTER) and 
the Standardized Usage Statistics Harvesting Initiative 
(SUSHI). Pesch describes COUNTER as a code of practice 
that e-resource access platforms can voluntarily adopt to 
consistently record and exchange a library's e-resource use 
information. 8 ' In a separate article, Pesch discusses how 
SUSHI builds on the COUNTER initiative. 88 He explains 
that SUSHI is a protocol through which COUNTER- 
compliant use statistics can be automatically transmitted 
from e-resource access platforms to a library's ERMS. In 
doing so, SUSHI relieves libraries from the tedious process 
of manually retrieving use statistics. 

The implications of initiatives such as COUNTER 
and SUSHI have been explored from a number of con- 
texts. Analyzing the e-resource use statistics of a large 
research library over a three-year period, Blecic, Fiscella, 
and Wiberley consider the effect of both COUNTER 
and enhancements to users' ability to search and access 
e -re sources. 89 Among the authors' key findings is that, while 
COUNTER has significantly enhanced libraries' ability to 
evaluate e-resource use, enhancements in users' abilities to 
search e-resources redefine the meaning of use statistics. 
Accordingly, they caution that enhancements in e-resources' 
searchability requires corresponding enhancements in the 
measures libraries rely on for evaluating use. 

In a study sponsored by the United Kingdom Serials 
Group, Shepard examines another topic related to the 
success of initiatives such as COUNTER and SUSHI: 
the viability of developing usage factors (UF). 90 The UF 
would offer a means for measuring a serial's quality on the 
basis of use statistics. Describing the results of a survey of 
authors, editors, librarians, and publishers, Shepard reports 
that "there is significant support, even among established 
publishers whose journals perform well in IF [ISI impact 
factor] rankings, for the development and implementation 
of journal UFs." 91 The findings of Duy and Vaughan offer 
further insight on the relationship between e-serials' use 
and IFs. 92 Assessing the use and citations of chemistry and 
biochemistry serials at Concordia University Libraries, the 



authors found that, while there were strong correlations 
between print and electronic use and between electronic 
use and local citation data, there was no correlation between 
IFs and electronic use. 

Archiving 

The 2006-7 literature's most far-reaching analysis of e-serial 
archiving initiatives is a Council on Library and Information 
Resources report authored by Kenney and colleagues. 93 This 
report discusses the results of a survey of twelve e-serial 
archiving initiatives in which representatives of the initia- 
tives were questioned regarding six topics: "organizational 
issues, stakeholders and designated communities, content, 
access and triggers, technology, and resources." 94 Based 
on the responses, the initiatives were evaluated regarding 
their ability to meet indicators for success. These indicators 
concerned each initiative's mission and mandate, rights and 
responsibilities, content coverage, minimal services, access 
rights, organizational viability, and role within a network. 
Key among the report's recommendations are that initiatives 
"should present compelling public evidence that they offer 
at least the minimal level of service for well-managed collec- 
tions" and that they clearly indicate the publishers and hold- 
ings included. 95 Further recommendations involve securing 
guarantees that holdings can never be removed; considering 
the implications of holdings' entry into the public domain; 
and forming a network of initiatives in order to provide 
mutual support, broaden collaboration, and enhance com- 
munication. 

The archiving initiatives receiving the most attention 
in the literature are Portico and Lots of Copies Keep Stuff 
Safe (LOCKSS). Portico is a nonprofit initiative developed 
with support from JSTOR, Ithaka, the Andrew W. Mellon 
Foundation, and the Library of Congress. Fenton, the exec- 
utive director of Portico, describes the initiative's archiving 
strategy as the normalization of the source files contributed 
by participating publishers. 96 This approach aims to facilitate 
the successful migration of the files as new data formats 
replace current formats. Portico grants supporting librar- 
ies access to archived content following designated "trigger 
events" or, in some cases, following a supporting library's 
cancellation of an archived resource. LOCKSS archives 
e-serials using a different strategy. As Seadle states, this 
initiative "offers a community-based rather than a corporate 
approach." 9 ' He expands to explain that LOCKSS consti- 
tutes a network of libraries using the same open-source 
software. This software both archives the source files of 
participating publishers and maintains the integrity of these 
files by comparing the contents of each libraries' LOCKSS 
archive with the archives of other libraries in the network. 
In contrast to Portico, LOCKSS does not normalize source 
files. Due to concerns that normalization may corrupt data 
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and alter content, LOCKSS relies on a bitstream approach 
to archiving that preserves content precisely as it appears 
to users. 



Conclusion 

The 2006-7 literature shows that serials librarianship is in a 
period of great innovation. Propelled forward by user prefer- 
ences, libraries are rapidly transitioning from acquiring seri- 
als in print to providing access electronically. Accompanying 
this transition in the formats of collections are evolving con- 
cepts of seriality and increases in subscription costs. Among 
the outcomes of these changes are new ideas regarding the 
models through which serials are acquired. Although more 
established models such as publisher packages continue to 
pervade, libraries are demonstrating growing interest in 
alternatives. These alternatives include relying on OA con- 
tent and acquiring access through arrangements that do not 
include provisions for perpetual ownership. Countering this 
latter strategy are voices within the profession that advocate 
the need for libraries to secure perpetual ownership provi- 
sions during the acquisition process. 

Innovations are equally apparent in the literatures 
discussion of serial access, management, and initiatives. 
Online catalogs, link resolvers, and metasearch engines are 
emerging as libraries' primary points for providing serial 
access. For each of these access points, efforts are underway 
to evaluate and enhance users' abilities to search for and 
access content. Meanwhile, managers are achieving change 
by reassessing and restructuring workflows, organizations, 
and communication channels so that they are focused on 
the electronic access and administration of serials. Finally, 
stakeholders throughout the serials landscape are partner- 
ing to develop new initiatives. For example, SERU holds 
promise as a pragmatic alternative to license negotiations; 
COUNTER and SUSHI are enhancing the evaluation 
of e-serials; and archiving initiatives such as Portico and 
LOCKSS are providing mechanisms through which libraries 
can retain perpetual access to their e-serial collections. 

Looking to the future, the literature is sure to reflect 
further innovations in the movement to transform serials 
and libraries. With these innovations will come significant 
challenges to the imaginations of those engaged in serials 
librarianship. For example, the 2006-7 literature shows a 
gulf between some of the alternative models being explored 
for acquiring serial access and the perspectives of commen- 
tators advocating the need to secure perpetual access provi- 
sions. Publications aiming to both clarify and reconcile these 
differences between the need to meet users' expectations for 
expansive e-serial access and research libraries' traditional 
commitment to retaining ownership of their collections 
would be welcome additions to the professional literature. 



Also of value to the professional literature would be 
more publications examining the wider effect of the transi- 
tion to e-serials on libraries' organizational structures and 
tools for providing and managing e-serial access. Indeed, 
while the 2006-7 serials literature includes numerous con- 
tributions discussing the implementation of specific tools 
and tasks related to e-serials, the literature includes rela- 
tively few publications addressing the large-scale implica- 
tions that the centrality of e-serials is having on libraries. For 
example, the literature would be enriched by publications 
describing how the transition to e-serials has led to larger 
changes in the organization of departments and workflows 
and in the overall infrastructure of tools libraries rely on 
to manage and provide e-serial access. The 2006-7 serials 
literature's focus on specific tools, projects, and procedures 
likely will serve as a springboard for future contributions to 
the literature that explore the broader effect of innovations 
in serials librarianship. 
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This paper was originally conceived for 
a special edition celebrating the fiftieth 
anniversary of LRTS. For that reason, the 
scope of the paper is limited to works 
published in LRTS. During the period 
covered in this discussion, cataloging 
code reform was discussed in many 
other prominent library journals as well 
as in LRTS. The bibliographies of pieces 
cited in this paper point to works pub- 
lished in Journal of Cataloging and 
Classification, Library Quarterly, Library 
Trends, Annals of Library Science, Journal 
of Documentation, Library Association 
Record, and College and Research 
Libraries, for example. Additionally, the 
discussion of cataloging code reform 
was not limited to libraries using the 
Anglo-American cataloging tradition; 
considerable debate— influenced by 
librarians such as Eva Verona not men- 
tioned in this paper— occurred in librar- 
ies using European cataloging traditions 
during the 1950s and 1960s. Those inter- 
ested in cataloging code reform also 
may wish to explore the history of cata- 
loging outside the Anglo-American tra- 
dition. 



Criticism of Cataloging 
Code Reform, as Seen 
in the Pages of Library 
Resources and Technical 
Services (1957-66) 

By Steven A. Knowlton 

The history of cataloging rules is often written as a story of continuous improve- 
ment toward a more rational and efficient code. Not all catalogers, however, have 
been in agreement that reform of the cataloging code has been improvement. The 
debate of the 1950s and 1960s over cataloging code reform, hosted in part by 
LRTS, is an example of conflicting values in the cataloging community. Seymour 
Lubetzky's proposal for a cataloging code based on logical principles eventually 
became the Anglo-American Cataloguing Rules, but many catalogers of the period 
felt that other values, such as tradition and the convenience of the user, also 
deserved consideration in the cataloging code. 

The library historian Wiegand has said, "We are all prisoners of our own dis- 
courses," meaning that the stories we tell about ourselves influence our views 
of our place in culture and society. 1 For librarians in the United States, that means 
that they often consider their institutions "cornerstones of the communities they 
serve" because "free access to the books, ideas, resources, and information in 
America's libraries is imperative for education, employment, enjoyment, and self- 
government." 2 

What librarians tell themselves and each other about their professional values 
plays an important part in how they perceive their own history. Many librarians 
view the library as an institution that has been instrumental in moving society 
toward "modernity, progress, and science." 3 Whether the values of modernity, 
progress, and science are appropriate values to guide librarianship goes unques- 
tioned by librarians, for the most part. 

A similar discourse is evident in discussion of the history of Anglo-American 
cataloging codes. Wynar and Taylor have stated that the current cataloging code, 
Anglo-American Cataloguing Rules, is "the result of a progression of ideas about 
how to approach the cataloging process in order to prepare catalogs that provide 
the best possible access to a library collection." 4 Chan has written that earlier codes 
were "pedantic, elaborate and often arbitrary." 3 These ideas were introduced in 
basic cataloging textbooks in 1985 and 1994, and such thinking dominates histori- 
cal discussion of the efforts of the 1950s and 1960s to reform the cataloging code. 
Inspection of the written record of the cataloging profession, however, indicates 
that the view of the Anglo-American Cataloguing Rules as an improvement over 
then-current cataloging codes was not universally shared. 

The pages of LRTS abound with debate over the cataloging code, and in 
celebration of the fiftieth year of LRTS, this paper seeks to demonstrate how the 
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reform of the cataloging code was accompanied by many 
divergent voices, whose claims may help reframe the dis- 
course about the history of cataloging. The discussion about 
cataloging code reform was not only a technical debate over 
the merits of various methods of entry, it was also a multi- 
layered debate about the values that should prevail in the 
cataloging profession. At one level was the question of cost 
in time and money to revamp the existing catalogs — and in 
the cost to scholarship of retraining the research community 
in the use of the catalog. At another level was the question 
of whether the admittedly important value of logic should 
prevail completely over other values that had motivated 
earlier framers of cataloging codes, such as tradition and the 
convenience of the user. The latter term, as used in defense 
of retaining the former cataloging code, generally referred 
to the practice of entering a heading where a reasonable 
user was presumed to be likely to look for it — "the public's 
habitual way of looking at things." 6 

While librarians know today that Seymour Lubetzky's 
vision of a logical, principled cataloging code did indeed 
prevail, considerable dissent met the notion that his way 
was, in fact, the best way to prepare catalogs. As catalogers 
are today working on yet another round of cataloging code 
reform, a useful exercise for today's catalogers may be to 
review the debates of the past. In this way, the debate may 
travel outside the discourses that have dominated think- 
ing about cataloging since the adoption of the first Anglo- 
American Cataloguing Rules (AACR) in 1967. ' 

The First Century of Cataloging Codes 

Since Antonio Panizzi published the "91 Rules" for com- 
piling the book catalog of the British Museum in 1841, 
cataloging codes have been in a continuous state of change. 8 
Charles Coffin Jewett adapted most of Panizzi's rules for the 
Smithsonian Institution in 1850, Charles A. Cutter devoted 
several decades of the second half of the nineteenth century 
to developing Rules for a Dictionary Catalog, and Melvil 
Dewey's Library School Rules (1888) reflected his work 
directing the Columbia School of Library Economy. 9 In 
addition, codes by Klaus August Linderfelt and Frederick B. 
Perkins and a pamphlet of suggestions by the Library Bureau 
were also in circulation in the late nineteenth century. 10 

Contemporaneously, a committee (which included 
Cutter) of the American Library Association (ALA) pre- 
pared a "Condensed Rules for an Author and Title Catalog" 
in 1883. 11 However, within a couple decades, the rules had 
not prevented "considerable divergence in the practice 
even of libraries organized subsequent to 1883. " 12 Between 
1901 and 1908, a second committee (again including 
Cutter) worked to develop a revised cataloging code "to 
bring about uniformity between its revision of the A.L.A. 



Rules, the 4th edition of Cutter's Rules for a Dictionary 
Catalog . . . and the Library School Rules." 13 The com- 
mittee also worked with the Library Association in Great 
Britain to harmonize the cataloging codes used in the 
United States and in the United Kingdom. The resulting 
Catalog Rules: Author and Title Entries (American edition) 
and Cataloguing Rules: Author and Title Entries (British 
edition) were jointly adopted by the ALA and the British 
Library Association in 1908. 14 

The necessity for all libraries to adopt a shared set of 
cataloging rules had become steadily more apparent as 
early union catalogs were created, and had taken on added 
urgency in 1901, when the Library of Congress (LC) issued 
printed catalog cards for titles it had received. 15 As librar- 
ies across the country took these printed cards into their 
catalogs, their locally cataloged materials required entry 
and description according to the same rules as the titles 
cataloged at LC. Hence, the adoption of the 1908 rules was 
achieved after only a short review period, and with near- 
unanimity between the two largest library associations in 
the English-speaking world (separate British and American 
editions were issued, but with only minor differences in the 
rules). It was the first set of cataloging rales to achieve wide- 
spread acceptance in libraries in the United States. 16 

The 1908 code, and each code that followed, was lim- 
ited to rules for descriptive cataloging. Although some of the 
earlier codes, including Cutter's, included rules for subject 
entry, an English-language subject cataloging code for uni- 
versal application has not yet been developed as of 2008. 

After 1908, the LC introduced many changes and addi- 
tions to the rales on an ad hoc basis, to address cataloging 
issues not covered by the 1908 Catalog Rules: Author and 
Title Entries. These changes and additions were issued to 
libraries that subscribed to the LC's catalog cards, but "in 
the absence of any supplementary rules from the American 
Library Association since 1908, libraries . . . had to formu- 
late their own rales, relying chiefly for guidance on rales 
issues occasionally by the Library of Congress, added to 
such deductions relating to practice as could be made from 
the printed cards as examples." 17 By 1930, librarians felt a 
need for a revised code to incorporate the LC's revisions 
and reduce local variation in cataloging, so work began on 
an updated set of cataloging rules. 18 The motivating idea for 
the revisers was the feeling that the 1908 rules had not been 
extensive enough, so that the revised rales would cover more 
circumstances that proved troublesome to catalogers — such 
as serials, anonymous classics, music, maps, pseudonymous 
works, and corporate authorship. The coverage of such fine 
details meant that the 88 pages of the original Catalog Rules: 
Author and Title Entries became 408 pages in the revision 
published in 1941. 19 Furthermore, most of the justification 
for the rales came from prior use, or "precedent," rather 
than any logical reasoning; many rales had exceptions; 
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and inconsistencies in the treatment of different types of 
material were noted. The structure of the rules, consist- 
ing of compound complex sentences with few illustrative 
examples, made the application of the rules difficult. 20 

The complexity of the 1941 edition (which applied only 
in the United States, as British librarians could not partici- 
pate in the revision due to World War II) lead to dissatisfac- 
tion in some quarters. Osborn published a famous article, 
"The Crisis in Cataloging," in 1941, in which he lamented 
the large backlogs in cataloging departments and predicted 
that an even more complex set of rules would further slow 
down catalogers — an irony in light of the new rules' purpose 
of easing catalogers' work by providing rules for more types 
of publication and issues of entry. 21 

Cutter had attempted to generate his rules according to 
the objectives of a catalog: namely, to allow the user to find a 
book, to show what the library has, and to assist in the choice 
of a book. 22 However, the 1908 rules and their subsequent 
revisions had excluded the statement of these objectives. In 
response to the criticism of the 1941 rules, the ALA com- 
missioned another revision (A.L.A. Cataloging Rules for 
Author and Title Entries) to simplify the rules and arrange 
the presentation so that the principles behind the rules 
would be more apparent. 23 This 1949 revision also elimi- 
nated rules for description of the book; only rules for entry 
were included. Despite these changes, the 1949 revision was 
also criticized for its complexity and unwieldiness. 24 

Although the 1949 ALA cataloging rules had omitted 
rules for description, the LC published its own Rules for 
Descriptive Cataloging in the Library of Congress in 1949; 
the rules were originally prepared for internal use, but were 
published for the wider library community in order to pro- 
vide guidance for librarians using catalog cards printed by 
the LC. 25 The 1949 ALA rules for entry and 1949 LC rules 
for description became known, respectively, as the "Red 
Book" and the "Green Book" from the colors of their bind- 
ings. The Rules for Descriptive Cataloging at the Library 
of Congress (RDC) were considerably simplified from the 
1908 and 1941 codes, and were largely praised for this fact. 

Seymour Lubetzky's Cataloging Rules 
and Principles 

In light of the praise for RDC and the less positive reception 
of the ALA rules for entry, the Board on Cataloging Policy 
and Research determined to approach rules for entry in the 
same fashion that the LC's rules for description had been 
developed: namely, "prepare the simplest code of descrip- 
tive rules which could meet the established needs." 26 To 
begin the work of preparing the simplest code, the ALA 
engaged the services of Seymour Lubetzky, a librarian at 
the LC who had also worked on the RDC. Lubetzky first 



prepared a critique of the 1949 ALA rules for entry, called 
Cataloging Rules and Principles: A Critique of the ALA 
Rules for Entry and a Proposed Design for the Revision. 21 
Lubetzky's critique not only pointed out the flaws in the 
existing rules for entry, but laid out the need for establish- 
ing a set of principles from which an improved code could 
be built. It included his famous question, "Is this rule nec- 
essary?" to which the answer was often "no" because the 
determination of the form of heading or rule of entry could 
be discerned from a larger principle, without need for a 
specific rule. 28 

According to Tillett, Lubetzky felt that the cataloging 
rules had become so complex because catalogers had lost 
sight of the reason for the catalog: to help users identify and 
distinguish among works that meet their needs. 29 Cataloging 
rules that expressed the principles defined by Cutter (and 
refined by Lubetzky) would of necessity be simpler, and 
would allow catalogers to create better catalogs. 

By 1954, the ALA had decided to prepare a complete 
revision of both the Red and Green Books, and appointed 
a Catalog Code Revision Planning Committee to the task 
of overseeing and advising Lubetzky's drafting of a revised 
code. 30 Over the next decade, many discussions about the 
revised code were held in symposia and in the pages of 
journals. Almost all discussion focused on the approach 
to cataloging presented by Lubetzky, whose work became 
the sine qua non of the new cataloging code. As Dunkin 
wrote in 1959, "The genius of Seymour Lubetzky now 
dominates our thinking about the catalog as completely as 
Cutter once did." 31 



LRTS as a Forum for Debate 

In this environment of serious contemplation of the prin- 
ciples by which works should be cataloged, LRTS was 
launched in 1957. Debate over cataloging code reform was 
not limited to the pages of LRTS, but the pieces presented 
in that journal form a useful record of the voices for and 
against reforming the cataloging code along Lubetzky's plan. 
Although both pro- and anti-reform articles appeared in 
LRTS between 1957 and 1966, this paper concentrates on 
articles composed by librarians who had reservations about 
the Lubetzky code, as they expressed a concern for values 
that have been considered of less importance than those that 
motivated AACR. Because the articles discussed Lubetzky's 
proposed code on its merits, a variety of perspectives 
(including some commendation of aspects of the proposed 
reform, as well as reservations about the changes) can be 
traced through the pieces under consideration. 

LRTS in its first decade was not the research-oriented 
journal it is today. Rather, it was a forum for news and debate 
over the latest trends in technical services librarianship. 32 
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Articles were frequently fewer than five pages long, and 
included reports from various ALA committees, opinion 
pieces, and even humor. Because of its nature as a pro- 
fessional round table, LRTS provided an opportunity for 
librarians to voice their concerns about developments in the 
revision of the cataloging rules outside of the formal struc- 
ture of a research article or literature review. Throughout 
the 1950s and 1960s, discussion of current cataloging issues 
was published regularly. 

The very first volume of LRTS, published in 1957, fea- 
tures a lengthy review of a symposium held at the University 
of Chicago in June 1956, called "Toward a Better Cataloging 
Code: A Beview." 33 An "unusually large" number of attend- 
ees (148) testified to the interest in the revision of the cata- 
loging code among librarians, and the concerns voiced by 
some of the speakers foreshadowed the debate that would 
follow for the next decade. 34 While a number of speakers 
expressed enthusiasm for Lubetzky's proposals to return 
to the basic principles of cataloging, most were concerned 
about the cost of recataloging items already entered. Angell 
delivered a more philosophical demurral. 35 Where Lubetzky 
wished to do away with all entries other than author and 
title, Angell preferred to retain form headings (for example, 
"Laws, statues, etc." or "Anonymous classics") as a natural 
entry (that is, an entry that a user would think to look under 
using his or her native intelligence). Angell also raised the 
point that both the Bed (ALA rules for entry) and the Green 
(LC rules for description) Books needed to be revised 
despite the general acceptance of RDC because the choice 
of entry influences how the name of the author may be 
described and because the descriptive rules should include 
provisions for media other than books. Osborn also urged 
that the new code seek to achieve harmony with codes of 
other countries. 36 Henkle (Lubetzky's former supervisor at 
the LC) raised the issue of user studies; some librarians felt 
that data from the observation of nonlibrarian catalog users 
should influence the code. 37 All these issues would continue 
to be important topics in LRTS for the next two decades. 

Another early article in LRTS supported Angell's pro- 
posal to revise rules for description along with rules for 
entry. "The Bed and the Green" by Waters of Georgetown 
University used a sample of publisher statements (of the 
RDC) to demonstrate the difficulty in determining proper 
description of that field according to RDC. 38 Waters felt that 
a review of the principles and goals that descriptive rules 
served should accompany the review of principles for entry, 
and that both set of rales should be revised simultaneously. 

The Draft Code and Its Discontents 

By 1958, Lubetzky had prepared a draft of a revised catalog- 
ing code, which was discussed by more than 175 librarians 



at the "Institute on Catalog Code Bevision" at Stanford in 
July 1958. 39 As promised in his earlier works, Lubetzky laid 
out the objectives of the catalog as the first statement of the 
code: "1) To facilitate the location of a particular work; and 
2) To relate and bring together the works of an author and 
the editions of a work." 40 The similarity to Cutter's objectives 
was noted — but Lubetzky had done away with another of 
Cutter's principles: serving "the convenience of the pub- 
lic" (in the sense of deferring to the searching practices of 
users). 41 This ambiguous phrase had led to many of the awk- 
ward, contradictory, and unintuitive rules in the 1941 and 
1949 codes, such as entering certain types of corporate body 
under their location and the use of form headings. Instead 
of "the convenience of the public," Lubetzky relied on logic 
in the observation that a simple rale, strictly followed, will 
become apparent to the catalog user and therefore serve 
him or her better than a maze of unexplained and incon- 
sistent rales with ad hoc exceptions for particular circum- 
stances. In this way, it was believed that the convenience 
of the public was served more effectively 42 To achieve the 
stated objectives, Lubetzky insisted on main entry under a 
name or tide. No entries under location or form were to be 
made. Lubetzky's draft code also addressed the contentious 
issue of corporate authorship by calling for entry of serials 
titles and corporate bodies that changed name under their 
successive names. A number of critics felt that this policy 
undermined the second objective. 43 

According to a report on the Stanford Institute, which 
was the first public discussion of the draft code, a number 
of attendees questioned the value of Lubetzky's second 
objective ("to relate and bring together the works of an 
author and the editions of a work"). 44 Many at the institute 
felt the draft code promoted excessive cross-entry, requir- 
ing more complex rather than simpler rules for entry. 
Wright questioned whether the code should consider sub- 
ject entry as well. 4 The issue of the cost of converting the 
catalog to a new code was raised, along with the necessity 
of international cooperation on cataloging rales. However, 
the institute achieved a consensus on the notion of prepar- 
ing the best code and then finding methods to achieve 
cost savings or international agreement afterward as the 
most productive approach. Further issues were raised, 
but left unresolved. These would continue to occupy the 
minds of catalogers as revision continued — the problems 
of corporate author entry (which circumstances require 
corporate, rather than personal, authorship; under suc- 
cessive or latest name; under subdivisions; under location) 
and serial title entry (successive versus latest title). During 
this period, Lubetzky wrote an article for LRTS explaining 
the process of code revision and his own reasoning behind 
the principles and rales used in the code. 46 After this, the 
task of defending the code in the page of LRTS against its 
critics fell to other writers. Following the Stanford Institute 
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(and in preparation for another held in Montreal at McGill 
University in 1960), Lubetzky prepared yet another draft 
of the code. 4 ' In response, the pages of LRTS featured 
many critiques of the proposed rules changes, as reviewed 
below. 

Concerns about Corporate Authorship in 
Lubetzky's 1960 Draft Code 

Although some of the issues debated were of a technical or 
practical nature, the issue of corporate authorship was one 
in which the values of the Lubetzky code stood in strong 
contrast to the values of the earlier codes. In particular, logic 
was pitted against tradition and user convenience, the latter 
referring to the sense that a catalog should have entries for 
corporate bodies where a user would look for them. 

Draper of the University of California, Berkeley was 
dismayed that discussions of cataloging code reform (and the 
1960 Draft Code) had not sufficiently addressed the problem 
of determining under which circumstances an entry should 
be made under a corporate author as opposed to a personal 
author. 48 He found the rule for entry under corporate body 
to be "highly vulcanized, i.e., full of rubber which can stretch 
in any direction at will," because the wording of the rule 
allowed for much latitude in interpretation. 49 

Haskins of Harvard University defended the to-be- 
discarded rules requiring entry of local or civic institutions 
under place by referring again to the convenience of the 
user: 

From the standpoint of the use of the catalog the 
most direct approach would appear to be by the 
place where [the institutions] are located. Also, 
there would seem to be a real advantage in bringing 
together the schools, hospitals, churches, muse- 
ums, etc., that generally may be of slight interest 
individually, but which play such a large part in 
the life of a city. If an institution bears a name that 
has little significance without the place where it is 
located, whether it be the Free Public Library, the 
First Church, Unitarian, Saint Paul's Church, or 
Saint Luke's Hospital, is it not logical to record it 
under the name of the place? 50 

Implementation of the Draft Code in 
Imagination and Experiment 

The Summer 1961 (vol. 5, no. 3) issue of LRTS featured a 
series of articles on the effects that implementing the 1960 
draft code (now called Code of Cataloging Rules: Author 
and Title Entry, an Unfinished Draft, or CCR) would have 



on the operation of libraries. Dunkin of Rutgers presented 
an overview of the changes catalogers would have to make 
in the switch from the 1949 rules to CCR. 51 He called it 
"Guesstimates Unlimited," but only pointed out three major 
areas that would require significant changes in the form of 
entry: the use of a uniform title following a personal name 
main entry (a new idea first proposed in CCR); the elimina- 
tion of the distinction between "institutions" and "societies" 
among corporate bodies, and the entry of all corporate bod- 
ies under name rather than place; and entry of anonymous 
works under title, rather than form. Dunkin offered sugges- 
tions for adapting the catalog to the new rules, such as using 
guide cards to provide cross-references from the older form 
of entry to the CCR form. 

Wright of Williams College presented the results of a 
survey of catalogers who were asked to examine entries cur- 
rently in their catalogs and determine if CCR would require 
changes in form of entry. 02 Under the rules 70 percent 
of headings would remain unchanged, 13 percent would 
require minor changes, and 17 percent of headings would 
be different. Most respondents reacted favorably to the new 
rules as "more explicit, more reasonable, and easier to use," 
although some expressed reservations about making such a 
large number of changes. 53 

Haskins wrote — on behalf of the librarians at Harvard — 
in defense of many of the old ALA rules, including form 
headings and entry under place for corporate bodies, "What 
is to be gained by giving up this type of heading which has 
been in use over a long period and is generally understood 
and liked?" 34 She also found much to object to in the impo- 
sition of new rules, such as uniform titles combined with 
author main entry, changes in the form of foreign names, 
and successive entry for corporate bodies that change 
names — mainly on the grounds of the need to revise and 
update thousands of catalog cards, with little gained (in the 
opinion of Harvard's librarians). She concluded with several 
thoughts about the flurry of cataloging rules changes that 
had come in the 1940s and 1950s: 

I am beginning to wonder if we, as librarians gen- 
erally and as catalogers specifically, know what 
we really want in the way of a cataloging code. 
We became dissatisfied with the 1908 code. For 
one thing it was too general. So a large commit- 
tee made up of extremely able people worked for 
many years to revise the rules. The result was a very 
detailed code. In that respect it should have been 
the answer to a catalogers prayer. Perhaps it was 
for many. But within a short time, even before the 
second (1949) edition was published, it was on the 
carpet and was severely criticized for its complex 
rules, when the trend was toward simplification, for 
its lack of organization, its lack of basic principles, 
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and so on. So once again we set to work. This time 
we started from scratch. . . . But from there on have 
we gone far enough or have we gone too far? Are 
we going to be successful this time? . . . We also 
started this revision by shutting out the past, clos- 
ing our eyes to all the water that had gone over the 
dam. We have now come to the point where we can 
no longer disregard what has gone before. . . . How 
much can the large research library afford in order 
to implement rules that call for so many changes 
in practice? 55 

Brown, of the Free Library of Philadelphia, wrote with 
concerns about the rules requiring uniform titles for works 
that appear under various names. 56 She preferred the entry 
as it appears in the work, whether it is title, corporate name, 
or personal name. Although it would create a "mongrel cata- 
log," her opinion was that users would be better served (par- 
ticularly in a large public library) by reducing the number of 
"two-step searches," which would be caused by the creation 
of uniform headings (step one was finding the proper head- 
ing from cross-references, step two was searching under 
that heading — a lengthy process when using a large card 
catalog). ' Further, she found that a rigid application of prin- 
ciples should give way to a consideration of user behavior: 
"The Nibelungenlied, whether considered from the point of 
view of bibliographical characteristics or from the point of 
view of use, differs significantly from a recently published 
government document on jet propulsion. Consistency is 
a virtue in developing a catalog, but . . . [i]t need not be 
interpreted to mean that the same policy must be applied 
to all material regardless of that material's bibliographical 
characteristics." 58 

Hines of Butgers wrote with concerns about Lubetzky's 
use of the term "work" instead of "book": 

The implication is that the work is to be considered 
as an intellectual rather than as a physical entity. . . . 
This distinction between the physical and intel- 
lectual cannot be pushed too far. It is clear that 
. . . Lubetzky does not mean that we should have 
a single main entry for Nine Plays of Bernard 
Shaw which would file with editions of Caesar and 
Cleopatra issued as physically separate bibliograph- 
ic units. ... It is here that a qualifying phrase seems 
to be needed in the draft code. It would appear that 
the code tacitly accepts the long-existing premise 
that the cataloger deals with physical bibliographic 
units, and that he catalogs them as such. . . . This 
preference for the physical bibliographic unit in 
cases of conflict [with intellectual units larger or 
smaller than the physical units] should be explicitly 
stated in the code. 59 



Beckman reported the results of an experiment at the 
University of Waterloo in which CCB was used to catalog 
new acquisitions. 60 Although she found the "revised code 
a pleasure to work with," and noted the ease with which 
her catalogers now addressed the names of authors, she did 
describe some difficulty in applying the rules for works of 
changing authorship, such as yearbooks and dictionaries. 61 
"The most difficult problem with this rule is that it is impos- 
sible to tell when handling a first edition of a reference work 
whether or not it will go into successive editions." 62 As well, 
the rules in this section diverged so far from current LC 
practice that Waterloo was unable to use, even in modified 
form, printed cards from the LC for those titles. 

The Paris Principles 

All such criticisms of CCB would no longer be addressed 
by Lubetzky; in 1960, he left the employ of the LC and 
accepted a professorship at the University of California, Los 
Angeles. The Catalog Code Bevision Planning Committee 
turned over the job of editing CCB to C. Sumner Spalding. 
Lubetzky made one more important contribution to the 
revised code in the form of his role in formulating what 
became known as the Paris Principles. 63 

As Osborn and others had noted, the American cata- 
loging code revision was taking place during a time when 
librarians in other countries were also contemplating cata- 
loging code reform. The destruction of many libraries in 
Europe during World War II made the possibility of revis- 
ing cataloging codes more feasible because the number of 
books requiring recataloging was much reduced. 64 Although 
the possibility of international agreement on cataloging 
rules had been explored at the International Congress of 
Archivists and Librarians at Brussels shortly after the pub- 
lication of the 1908 rules, those in attendance determined 
that differences between Anglo-American and continen- 
tal (particularly German) rules were too great. 65 During 
the 1950s, a number of library associations — including 
those of France, Poland, Japan, Spain, Italy, Switzerland, 
the U.S.S.B., and India — worked on revised cataloging 
codes. The Library Association (of the United Kingdom) 
determined that it would work with the ALA so that the 
revised code being prepared by Lubetzky would be Anglo- 
American. In light of these developments, the International 
Federation of Library Associations (IFLA) convened con- 
ferences in 1958 and 1959 to discuss the possibility of an 
international agreement on cataloging principles. The result 
of these discussions was the IFLA International Conference 
on Cataloguing Principles (ICCP), held in Paris, October 
9-18, 1961. 

In Paris, representatives from thirty-four national 
library associations met and agreed on the Paris Principles, 
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which served as the basis for future cataloging codes in most 
countries. Lubetzky's contributions included articulation 
of the principle of main entry. Another important point of 
agreement was the principle of corporate authorship, which 
had previously not been observed in Germany. 

The importance of the Paris Principles to catalog code 
revision was that revision could go forward with an inter- 
nationally accepted set of principles underlying it and also 
provide strict guidelines. As Kebabian of the University 
of Florida commented, "There is no doubt that American 
librarianship will be under world-wide scrutiny as our new 
code reaches completion." 66 As with Lubetzky's code, the 
Paris Principles stirred up some criticism. For example, Scott 
of the University of Oklahoma found some of the guidelines 
to be too vague: "Consistent catalog entries for current 
materials cannot depend on 'best known' or 'most frequently 
used' form. Such criteria are useful only in retrospect." 67 

The question of whether the proposed Anglo-American 
code would go forward truly based on the Paris Principles 
was another concern. Kebabian noted, 

Though the concept of the corporate author has 
been finding its way during the past ten or more 
years into French catalogs and bibliographies, for 
most national delegations this constituted the most 
fundamental break with tradition, and there was 
considerable debate at Paris before its final accep- 
tance. In the form which was approved, moreover, 
it includes at least two provisions which contradict 
current United States practice. ... It is ironical to 
reflect that these two principles were among those 
suggested by Seymour Lubetzky in his critique, 
Cataloging Principles, and that his studies and the 
preparation of that document stemmed from our 
desire to seek solutions to the inconsistencies of 
the "corporate complex" as structured in some sev- 
enty rules in the 1949 ALA code. While acceptance 
abroad at the ICCP was achieved, at home these 
principles constitute a problem of considerable 
consequence to achieve their reconciliation with 
existing entries in our long-establish, monolithic 
card catalogs; they are the one major source of 
yet unresolved compromise efforts in the current 
preparation of our code of cataloging rules. It is 
thus that the dead hand of history plagues us. 68 

Progress toward AACR 

With the Paris Principles in place, the Catalog Code 
Revision Planning Committee continued to revise the 
cataloging code. An important agreement was settled upon 
at the 1963 ALA Midwinter Meeting in Miami. After the 



LC and the Association of Research Libraries "complained 
that they would be unable to pay the cost of changing the 
headings on cards already in their catalogs if the Committee 
followed the IFLA Paris Statement of 1961 which called 
for the entry of all corporate bodies directly under their 
names," the Committee "decided to say plainly that the 
'institutions' rule is an exception to the Paris Statement 
name-entry principle." 69 Essentially, the parties agreed that 
entry for corporate bodies could continue under place. 
Without that agreement (which Dunkin called the "Miami 
Compromise"), the LC and large research libraries might 
not have adopted the new code, as it would have created 
an immense burden of recataloging. ,0 In 1963, the commit- 
tee decided that rules for description should be revised to 
encompass all media. 

These steps toward completion of the new code may 
have alleviated for some catalogers the weariness with the 
lengthy process of code revision. As Symons wrote in 1962, 

Any cataloging code must be a compromise between 
the principles of consistency and convenience (but 
whose convenience? Surely not the cataloger's). 
There are bound to be areas of conflict. The exact 
place where the compromise is made seems to me 
not to matter very much. Rather than waste several 
more years of time and emotion and inaction, I 
suggest we encourage the publication of a Revised 
Code as soon as possible, so that we can all get on 
with applying in our libraries (or not applying it, if 
we really dislike it heartily). 71 

After the Miami Compromise, the Committee (working 
closely with British and Canadian representatives), labored 
feverishly to prepare the final edition of the rules, which 
was published in early 1967 as AACR. 12 Although the rules 
conformed mostly to Lubetzky's principles, some exceptions 
were present, particularly those involving corporate entry 
under place.' 3 The committee recorded its regret "that, 
because of the great size of many American card catalogs, 
it was necessary for the Catalog Code Revision Committee 
to agree to the suggestions of the Association of Research 
Libraries that certain incompatible American practices be 
continued in the present rules." 74 Lubetzky himself was 
disappointed that AACR omitted a statement of principles, 
on which he had based his draft codes. For the most part, 
the catalogers accepted the new code and found its revisions 
worthwhile and useful. 76 

This does not mean that criticism of the cataloging rules 
ceased in 1967. Indeed, a paper twice this length could be 
written about the critiques of AACR that led to its revision 
in 1978. 
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Conclusion 

Most librarians using AACR today entered the profession 
after the code was published, and therefore accept it as the 
fundamental basis for cataloging. Further, many librarians 
believe that cataloging rules have improved over time so 
that the current rules most closely approach logical, princi- 
pled cataloging. Nonetheless, AACR was controversial in its 
day, not least for the major upheaval it caused to previously 
created catalogs. The process of superimposition followed 
by many libraries in order to accommodate AACR attests to 
the wide-ranging consequences of such a thorough revision 
of cataloging rules. 

The historical view of steady improving cataloging 
codes also feeds the library community's own self-image as 
leaders in "modernity, progress, and science." 77 However, 
many thoughtful librarians working during the days when 
AACR was being developed did not necessarily find the 
principles espoused by Lubetzky to be an improvement over 
then current practices. Some librarians felt that the values of 
tradition and user convenience were being disregarded. 

An appraisal of the record will show that LPiTS served 
as an important forum for discussing just how, why, and 
whether catalog code revision would truly make the catalog 
a better guide to a library's collection, and that the library 
community was far from unanimous in regarding AACR as 
progress. As the cataloging community moves forward with 
revision of the current catalog code, it would be well-served 
by an examination of those values, both stated and unstated, 
that motivate such revision. 
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Comparing Catalogs 

Currency and Consistency of 
Controlled Headings 

By Stephen Hearn 

Evaluative and comparative studies of catalog data have tended to focus on meth- 
ods that are labor intensive, demand expertise, and can examine only a limited 
number of records. This study explores an alternative approach to gathering and 
analyzing catalog data, focusing on the currency and consistency of controlled 
headings. The resulting data provide insight into libraries' use of changed head- 
ings and their success in maintaining currency and consistency, and the systems 
needed to support the current pace of heading changes. 

Much of the work of technical services takes place out of public view. Perhaps 
this explains in part why measures of technical services' contribution to the 
library are relatively lacking in compendiums of library measures. The number of 
volumes and subscriptions in a collection, the rate at which electronic resources 
are accessed, circulations and reference interviews — all of these are frequently 
cited as measures of academic libraries' performance, but rarely is the work con- 
tributed directly by technical services used as a library's performance measure. 
For some in technical services, there might seem to be an advantage to being 
"under the radar" when internal or library-to-library comparisons are done; but 
the lack of measures can also leave any operation unsure of its own success and 
of the validity of any local or shared set of norms. Having practicable methods of 
determining a technical services unit's success in meeting its goals and of assess- 
ing that accomplishment in relation to that of peer institutions can help technical 
services units build confidence in their goals, identify systemic problems, and 
contribute to library planning and priority setting. The study presented here seeks 
to define and test an approach to measuring one of the contributions of technical 
services: the use of consistent and up-to-date headings in the library catalog. 
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Methods of Measuring Catalog Data Quality 

One obvious component of library service is the product of technical services 
efforts: the data in the library catalog. The catalog assists users with finding 
known items in the collection; browsing the collection by subject, author, and title 
headings; browsing the result sets of keyword searches; examining and selecting 
items via their surrogate records; and locating the items desired. These basic ser- 
vices are provided through a wide variety of interfaces and displays. Vendors and 
designers of automated library systems offer a range of interface choices to their 
customers, and each library tailors its system's functionality and presentation for 
its users. Comparative evaluation of the differences between such varied interface 
options would inevitably be complex and highly subjective. In their review of the 
literature on quality in cataloging, Myall and Chambers note the difficulty and 
rarity of high-level evaluation of the catalog: 
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Quality of the overall catalog appears to be less 
frequently the subject of study . . . notwithstand- 
ing the fact that both Cutter's objects and much 
of FRBR's approach are focused on the catalog 
as a whole rather than on individual records. 
Presumably the limited extent of study at this 
level is due to the complexity and multi-faceted 
nature of the task, which now must include not 
only content and structure of the database, but 
also completeness and presentation of data on vari- 
ous screens, search engine execution, presence of 
context-sensitive help, and other elements in an 
environment in which users are familiar with many 
other Web-based information tools. 1 

Nevertheless, behind the variable screens of automated 
system interfaces, the data records that feed catalog indexes 
and displays are highly standardized. The widespread adop- 
tion of a core set of data standards by the U.S. academic 
cataloging community — the Machine-Readable Cataloging 
(MARC) 21 formats for mark up; the Anglo-American 
Cataloguing Rules, 2nd ed., rev, for description and name 
or title access; the Library of Congress' (LC) Subject 
Cataloging Manual: Subject Headings rules for subject 
access; and the LC's Name and Subject Authority Files 
(LCNAF and LCSAF, respectively) for authority-controlled 
headings — has enabled the sharing of catalog records 
through union databases, of which OCLC WorldCat is the 
prime exemplar, and the proliferation of library automated 
systems, all designed in their myriad ways to exploit the data 
contained in standard catalog records. 2 Studies of data qual- 
ity rather than the qualities of automated system interfaces 
can reasonably claim to be focusing on a crucial and compa- 
rable aspect of overall catalog performance. 

Past efforts to evaluate catalog data quality have gener- 
ally relied on thorough review of individual catalog records. 
In their recent survey of the literature on quality in catalog- 
ing, Myall and Chambers found much disagreement over the 
definition of quality. The most common model they found 
for data quality analysis, reported in eight studies, calls for 
selecting a set of catalog records and examining the differ- 
ent areas of each record — fixed fields, standard numbers, 
title and statement of responsibility, edition and publication 
statements, notes, and access points — for errors, inconsis- 
tencies, and omissions. 3 This kind of evaluation tends to 
be time and labor intensive, requiring a significant level of 
expertise and often the retrieval of items from the collection 
for comparison with the catalog records. R also raises com- 
parability issues. Libraries' standards for what constitutes 
an acceptable level of data quality and completeness vary 
across these several areas of description and access. For 
example, the extent to which libraries invest in the creation 
of table of contents or summary notes can vary greatly and 



are a matter of local policy; the trade-off between the added 
value of such notes and the added liability they represent as 
additional opportunities for error makes standard measure- 
ment difficult. 

An alternative for the study of catalog data quality is 
to look not at a sample set of records, but at a sample set 
of searchable data. The "Dirty Database Test" takes this 
approach, offering libraries a set of typographical errors 
to search in the catalog. 4 The number of errors thus found 
does provide a measure of data quality; however, this mea- 
sure tends to lack both context and focus. The prescribed 
typographical error search looks for one or one set (using 
truncation) of erroneous variants, and ignores the number 
of times the term in all its variants is spelled correctly in 
the database. In the absence of these other counts, deter- 
mining an error rate for comparison purposes with other 
catalogs is difficult. Typographical errors, some of which 
may be unlikely objects of a searcher's query, can occur in 
any term in the record. Lastly, as noted earlier, libraries that 
include tables of contents and other notes in their records 
are likely to increase error counts by this measure without 
regard to the overall enhancement of access that such notes 
represent. Searching for and correcting typographical errors 
is an important part of maintaining access, as demonstrated 
by a 2007 study by Beall and Kafadar, but less effective as a 
comparative measure. 5 

Heading Consistency as a 
Comparative Measure 

The aim of the current study is to explore another alterna- 
tive for evaluating catalog data quality. Rather than look- 
ing at whole records for a broad range of error types or 
searching for typographical errors, this study focuses on 
the consistency of selected authority controlled headings. 
Wolverton reports that a commitment to heading con- 
sistency as a goal is widespread among technical services 
departments. 6 In describing their use of the whole-record 
review or "audit technique" at University of Bath, Chapman 
and Massey observe, "Authority control is a valuable form 
of quality assurance which the audit technique is weak in 
evaluating, compared to checking descriptive catalogu- 
ing.'" The authors go on to note that their pilot study "was 
unable to confirm the feasibility of comparing headings to 
an authority file, which would inevitably increase the time 
required." 8 

Departing from the whole-record audit approach to 
focus instead on currency and consistency of controlled 
headings has a number of advantages as a comparative 
quality measure. It highlights heading data, which is of 
high value for discovery. It is less prone to differences over 
catalogers' judgment regarding how a particular resource 
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should be described. It is able to sample a much wider range 
of records than the audit technique. Lastly, it offers a point 
of comparison that most if not all catalog managers would 
acknowledge is valid, given their widespread commitment to 
maintaining current and consistent headings. 

Consistent headings are necessary to ensure that users 
can find all the items they seek under one heading. Many 
heading inconsistencies are obvious to the alert user in a 
browse search, where two similar headings for the same 
name may appear adjacent to each other in the index dis- 
play; in other cases, where the entry term for a name or 
subject heading changes, the two headings may be widely 
separated in the browse index. 

Inconsistencies can be less obvious, and therefore more 
of a problem, in other searching contexts. A keyword search 
may retrieve one of the split headings forms in its result 
set, and miss the other. A redirected search, prompted by 
clicking a highlighted access point in a catalog record's dis- 
play, may find only the records that match that access point 
exactly and exclude the variants. A "faceted browse" display 
of the type exemplified by North Carolina State University's 
implementation of Endeca and Ex Libris' Primo analyzes 
headings and other data present in a result set and presents 
them under such facet headings as "Authors" and "Subjects" 
in order of frequency in the result set. Such an analysis may 
find both current and obsolete heading forms in its result 
set but display only the more frequently used heading in 
its truncated list of facet terms. Because libraries' new- 
est items are the likeliest to carry the first instance of the 
revised form of a name or subject term, they are more likely 
to be low-posted and therefore less obvious in a faceted 
browse display, and possibly omitted altogether. Redirected 
searches are similarly prone to finding only records carrying 
the obsolete form of a heading and missing new resources 
with the current form of the heading if the heading split has 
not been corrected. 

In addition to having a clear effect on the service the 
catalog provides, heading inconsistencies are relatively 
easy to recognize. Shared standards for heading forms are 
already in place. Standard forms of headings are widely 
distributed through the LCNAF and LCSAF, and widely 
relied on throughout the English-speaking library commu- 
nity. Determining the agreement between a library's catalog 
headings and a standard heading form is also relatively 
straightforward and less demanding of expert judgment 
than the analysis of a full catalog record. Standardization 
makes gathering data on multiple libraries' consistent use of 
a sample set of headings with a fair degree of efficiency and 
accuracy both possible and practicable. 

One type of heading that offers an opportune focus 
for this kind of evaluation is a heading that has changed its 
authorized form. Authority files are dynamic, and the con- 
trolled terms for entities and concepts are always subject to 



revision for a variety of reasons. Two lists of changed head- 
ings are readily available. One is the long-standing "Library 
of Congress Subject Headings (LCSH) Weekly Lists," for- 
merly distributed in paper form, now posted online (www 
.loc.gov/aba/cataloging/subject/weeklylists). The weekly lists 
include new, deleted, and changed headings in the LCSAF. 
The changed headings are marked by the text "CANCEL" 
following the old 1XX heading form in the online list. The 
other list is more recent. On February 1, 2006, the LC post- 
ed a revised rule interpretation reversing its policy of dis- 
couraging changes to the authorized form of personal name 
headings simply to add a death date. As of that date, OCLC 
began compiling and posting online lists of established per- 
sonal name headings to which a death data had been added 
in the LCNAF under the title "Closed Dates in Authority 
Records" (www.oclc.org/rss/feeds/authorityrecords/default 
.htm). Together, these two lists are a handy source for sam- 
ples of authorized headings that have changed their forms. 

The hypothesis for this exploratory study is that an 
examination of the results of searching sample sets of 
changed name and subject headings in a collection of cata- 
logs will yield objective and comparable results indicative of 
the state of data quality control in those catalogs. 

Heading consistency can be evaluated in online public 
catalogs in two distinct contexts. Within a given catalog, 
whether the library's catalog records use the old or the 
revised form of a heading is arguably less important, pro- 
vided the same form is used in all cases. The criteria of 
consistency and complete retrieval can be met in both cases. 
However, when a catalog's access points are integrated with 
those of other libraries' catalogs in a union catalog or feder- 
ated searching environment, consistency within the local 
catalog may not be sufficient. The goal of consistent search 
results in union catalog contexts implies a shared commit- 
ment to using the latest form of an authorized heading, 
since that provides all contributing libraries with a common 
standard. For that reason, this study examines both these 
aspects of heading consistency: the rate at which heading 
"splits" (headings found in both old and new forms) occur 
in a single catalog; and the extent to which the new form is 
found to have replaced the old across a set of catalogs. By 
focusing on the data in the source records, interface vari- 
ability can be ignored in favor of measuring adherence to 
commonly held goals and data standards. 

Research Method 

Using the two identified sources, the LCSH "Weekly Lists" 
and the OCLC lists of "Closed Dates in Authority Records" 
for personal name headings, three sample sets of revised 
subject and name headings were compiled. The source 
lists are posted weekly, and the sample sets were drawn 
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from three separate starting points within each series, both 
to broaden the range of heading changes gathered and to 
reveal any changes over time in the updating of headings in 
the catalogs under review. The name sample sets were col- 
lected from lists spaced approximately six months apart. The 
subject samples were collected from lists spaced approxi- 
mately a year apart. In each case, the changed headings 
in the lists were reviewed from the chosen starting point 
forward in the list or lists until approximately fifty sample 
headings had been collected. For subjects, several lists were 
reviewed for each sample of fifty changed headings, since 
the number of changed subject headings in each LCSH 
weekly list is fairly small (approximately seven to nine). For 
names, each OCLC list examined exceeded the required 
number of fifty changed headings. With each new list 
sampled, the alphabetical end point of the previous sample 
set was used as the starting point for assembling the next set 
of fifty changed name headings. 

Once the lists were assembled, a set of target pub- 
lic catalogs was selected. The home for this study is the 
University of Minnesota, which belongs to the Committee 
on Institutional Cooperation (CIC), a group of twelve large 
research universities. 9 The CIC universities have a history 
of cooperation, including the maintenance of a federated 
search of CIC catalogs for a time, and are often used as 
peer institutions for comparison purposes. The thirteen 
catalogs of the CIC libraries (counting the library catalogs 
of the Chicago and Urbana-Champaign campuses of the 
University of Illinois as separate catalogs) were therefore 
selected to test the method being explored. Because this 
study is exploratory, the names of the catalogs studied have 
been randomly ordered and replaced by A, B, C, and so on, 
in the results. The LC's public catalog was also included 
in the set of target catalogs, since it is the source for many 
of the changed headings. Because the LC's results show a 
significantly greater use of the headings under review and 
therefore make its identity obvious, the LC's results have 
been labeled. 

All study data were gathered by searching the public 
catalogs of the target institutions. None of it depended on 
privileged access to information. The old and new forms of 
each of the sample headings were browse searched in each 
target public catalog, and the number of hits found for the 
old and new forms of the heading was recorded in a spread- 
sheet, the primary tool for data gathering and manipulation. 
Spreadsheet formulas were used to calculate the percentage 
of new headings found. In the spreadsheet, "0%" indicates 
that only instances of the old heading were found; "100%" 
indicates that only instances of the new heading were found; 
and any percentage in between indicates a split between the 
old and new forms. Where no use of either form was found, 
"NA" (not applicable) was substituted for a percentage. A 
sample of the spreadsheet appears in table 1. The project 



spreadsheets (without institution names) have been depos- 
ited in the University of Minnesota's institutional repository, 
where they are available for external review. 10 

For the subject samples, separate counts were made 
of unsubdivided and subdivided forms of the old and new 
heading. For the name samples, separate counts were 
collected under the old and new form for the name as an 
author and as a subject. These refinements to the counting 
were made to assess whether heading consistency in the 
catalogs studied differs for authorized heading strings (the 
unsubdivided subject strings) versus unauthorized heading 
strings (most subdivided subjects) and for personal name 
headings in author indexes versus personal name headings 
in subject indexes. In expressing the results of the study for 
this paper, the counts of unsubdivided and subdivided sub- 
ject heading forms have been merged to show a single count 
of old versus new main heading forms for the subject index 
as a whole. The results of the personal name searches are 
reported separately for name indexes and subject indexes. 

Some deselection of the headings initially found in the 
sampling process proved advisable. Several types of head- 
ings were excluded: 

1. Headings with old and new forms that would normalize 
and file identically, e.g., headings changed to remove a 
hyphen or to correct diacritics, capitalization, or tag- 
ging. These differences would be difficult to discern in 
index displays and, in any case, appear unlikely to affect 
access. 

2. Main headings appearing multiple times with different 
subdivisions or with different phrase extensions (e.g., 
". . . in art"). A few instances of this were retained in 
the study to explore whether the presence of an estab- 
lished heading-plus-subdivision string in the author- 
ity file accounted for a higher rate of correction. The 
minimal results from the few cases in the study sample 
suggest not, but are far from conclusive. 

3. Headings with more than two forms, e.g., those that 
changed again following the "new" form's establish- 
ment, or those that merged two earlier headings. The 
presence of multiple forms would require exceptional 
forms of counting. Given the relative rarity of instances 
like this, the few encountered were generally omitted. 
(An exception: the more heavily used older form "Hog 
cholera" was counted and the alternative "Swine fever" 
was not, though both were merged in the new heading 
"Classical swine fever.") 

4. Narrower subject headings merged into broader pre- 
existing ones, e.g., the formerly established "Middle 
Ages — History" being merged into "Middle Ages." 
Counting the number of changed headings under the 
preferred form would be impossible in most public 
catalogs. 
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5. Headings with an identical form in another heading 
system (e.g., the same term used by LCSH and MeSH) 
or MARC tag category (e.g., the same term used as 
a topical 650 and a genre/form 655 subject heading) 
also proved problematic. Some catalogs sort these 
differences into different indexes, but not all; if they 
appear in the same index, counting the instances of 
the changed bibliographic headings becomes difficult, 
requiring a record-by-record examination of MARC 
tag values. 

The need for sensitivity to these kinds of problems 
makes the compiling of the sample sets a task that requires 
a cataloger's expertise. However, once the list is compiled, 
the process of searching each form and counting hits can 



be learned quickly and requires few judgment calls. The 
method used here was further simplified by regarding 
only the established old and new heading forms on the list. 
Other forms were occasionally observed in the indexes (e.g., 
headings with typographical errors), which also caused split 
headings, but these were not counted. The only heading 
splits reported are between instances of the two forms on 
the sample list. 

The data were gathered between January 2 and 
February 8, 2008, roughly one year after the date of the 
most recent heading change list sampled. When this task 
was complete, summary counts were made for each of the 
catalogs, showing for each set how many of the sample 
headings were found to be all old form, all new form, split 
between the old and new form, or unused in each catalog. 



Table 1. Changed LC Subject Headings— Data Sample (Excerpts) for One Catalog 



Cancelled Term 



Baldwin Hills 
(Calif.) 

Breast feeding 

Breast feeding — 

Immunological 

aspects 

Breast feeding — 
Law and legislation 

Church of England. 
Book of common 
prayer. Psalter 

North Shore 
(Mass.) 

Unites States 
Highway 58 

Dargari language 

Reparation 

Aranda language 

Black humor 
(Literature) 



New Term 



Baldwin 
Hills (Calif.: 
Mountains) 

Breastfeeding 

Breastfeeding — 

Immunological 

aspects 

Breastfeeding — 
Law and 
legislation 

Church of 
England. Psalter 



North Shore 
(Mass.: Coast) 

United States 
Highway 58 (Va. 
and Tenn.) 

Tharrkari 
language 

Reparation 
(Criminal justice) 

Western Arrernte 
language 

Black humor 



List Year 
and 
No. 



2005.1 

2005.1 
2005.1 

2005.1 

2005.1 

2005.1 
2005.1 

2005.2 
2005.2 
2005.3 
2005.3 



No. Of 
Uses of 
Old LC 
Heading 

o 



With 
Subdivision 



No. of 
Uses of 
New LC 
Heading 





203 
1 



With 
Subdivision 



Percent 
Using 



Percent 
with New 



New LC Subdivision 
Heading 



323 
2 



703 



29 



NA 

100 
100 

100 

100 

NA 
100 

100 
100 

91 



NA 

100 
100 

91 

100 

10 
100 

100 
99 
50 
91 
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The summary for the sample data in table 1 is illustrated 
in table 2. 



Results 

The summary table data have been expressed as a series 
of doughnut graphs for quicker comprehension. Each 



Table 2. Changed LC Subject Headings— Data Summary for 
One Catalog 



2007 

All old headings (0%) 

All new heading (100%) 

Split headings (l%-99%) 

Unused headings (NA) 

Percent split of all used 

Percent used of all checked 
(n = 52) 

2006 

All old headings (0%) 

All new heading (100%) 

Split headings (l%-99%) 

Unused headings (NA) 

Percent split of all used 

Percent used of all checked 
(« = 46) 

2005 

All old headings (0%) 

All new heading (100%) 

Split headings (l%-99%) 

Unused headings (NA) 

Percent split of all used 

Percent used of all checked 
(n = 49) 



LC Headings 



Base Heading 
Only 




27 
6 
19 
18 
63 



1 

31 
10 

4 
24 
91 



2 

26 
5 
16 
15 
67 



Base + 
Subdivided 





34 
9 
9 
21 
83 



1 

29 
12 
4 
29 
91 



2 
31 
12 
4 
27 
92 



doughnut graph shows proportionally the four states of the 
sample set headings found in each catalog: all instances in 
the old form, all instances in the new form, instances split 
between old and new forms, and no instances in the catalog. 
The outermost ring represents the sample set of the earli- 
est changed headings, and the innermost ring represents 
the most recently changed sample (see figures 1-3). This 
arrangement of the data makes it easier to see patterns over 
time in the proportions of each institution's sample sets. 

Figure 1 represents the states of changed LC subject 
headings in the target catalogs. Some catalogs (LC, D, F, 
G, H, I, K) show most of their headings fully converted to 
the new form, while others (A, B, C, E) are largely uncon- 
verted or split for all three sample sets. Changed subjects 
also account for the largest proportions of heading splits 
overall when compared with the changed name headings. 
Catalogs J and L show the least use of the headings stud- 
ied, while catalogs K and F and the LC s catalog show the 
greatest use. 

Figure 2 represents changed LCNAF personal name 
headings in author indexes. Changed names show a slightly 



Library of Congress 

















KEY-Outer to inner rings show 2005, 2006, and 
2007 samples of changed LCSH headings 




□ All old headings (0%) 

■ All new headings (100%) 

□ Split headings (1%-99%) 

□ Unused headings (NA) 



Figure 1 . States of Changed LC Subject Headings in CIC Subject 
Indexes 
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KEY — Outer to inner rings show state 
of personal name headings changed 
in 200602, 200608, and 200702 

■ All old hdgs (0%) 
k ■ All new hdgs (100%) 
□ Split hdgs (1%-99%) 
□ Unused hdgs (NA) 



Library of Congress 

















KEY~Outer to inner rings show state 
of personal name headings changed 
in 2006/02, 2006/08, and 2007/02 in 
Subject indexes 



□ All old hdgs (0%) 

■ All new hdgs (100%) 

□ Split hdgs (1%-99%) 

□ Unused hdgs (NA) 



Figure 2. States of Changed LC Personal Name Headings in CIC 
Author Indexes 



Figure 3. States of Changed LC Personal Name Headings in CIC 
Subject Indexes 



higher rate of use across the CIC catalogs than did changed 
LCSH headings. Catalogs G, I, and K show large propor- 
tions of fully converted headings across all three samples, 
while catalogs C, E, and M are largely unconverted. The 
mixed results are seen for catalogs A, D, H, J, and L. Each 
shows one or two of die sample sets represented by each of 
the rings largely converted, but not all three. Such mixed 
results are more common for changed names than for sub- 
jects, suggesting that different approaches are being taken 
for these different kinds of maintenance. 

Figure 3 represents changed LCNAF personal name 
headings in subject indexes. Not surprisingly, this set of 
results shows the lowest use rates, though even here use 
of less than a quarter of the sample headings is relatively 
uncommon. Catalogs G, I, and K again stand out as the 
most fully converted, while catalogs B, C, and E are largely 
unconverted and A, D, and H show mixed results. 

This analysis does not take into account the number of 
hits found for each heading in each catalog. It looks only 
at whether the heading is present, whether all matches 
are on the old or the new form, and whether they are split 
between the two forms. The raw data, however, do include 



hit counts for each form of the heading and could support 
other kinds of analysis. For example, splits can only occur 
when a heading appears more than once in a catalog. Single 
appearances ranged from two to ten per catalog in the name 
samples, averaging five to six, and were always outnumbered 
by headings appearing multiple times. This indicates that a 
low count for splits in a particular catalog cannot simply be 
attributed to single appearance headings that could not be 
split. 



Discussion 

An examination of the data gathered prompts diree kinds of 
analysis. The first considers the extent to which the sampled 
headings were found to be in use in the target catalogs and 
whether the method chosen to assemble the sample sets of 
headings was effective. The second responds to the ques- 
tion that prompted this project — can a study of the state of 
changed headings in library catalogs provide a useful mea- 
sure of data quality for those catalogs? The last takes up two 
more general questions — are the true states of headings in 
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our catalogs sufficiently consistent with common models of 
data quality; and are there changes to the systems involved 
in revising headings that could simplify the task of heading 
maintenance? 

Headings 

The sample headings used in this study are listed in appen- 
dixes A and B. Each list shows both of the forms searched 
for each heading, the list from which they were drawn, and 
the number of catalogs out of the thirteen CIC catalogs 
surveyed in which the heading was found. The entries in 
the list have been sorted in order of their frequency of 
occurrence. 

One question being explored by this study was whether 
the sample-generating method used would yield headings 
that produced useful data. Many name and subject head- 
ings occur only infrequently even in large catalogs, and 
selecting headings without regard for their narrowness or 
obscurity might have resulted in very few hits. Looking 
only at the CIC catalogs, twenty-one out of the 147 subject 
heading pairs checked were unused. At the same time, 
thirty-six were found in all thirteen CIC catalogs, including 
twelve from each of the three sample lists. Subject head- 
ings appearing in at least half of the target catalogs account 
for 50 percent of the full subject sample list. In the case of 
name headings, fifteen out of 155 name pairs sampled were 
unused, while fifty were found in all thirteen CIC catalogs, 
including at least twelve in each sample set. Name pairs 
used in at least half of the CIC catalogs accounted for 68 
percent of the name sample as a whole. The investigator's 
ability to discern which name headings would have no or 
few hits was minimal. Subject headings proved a bit more 
predictable. Headings for geographic features had very low 
hit counts, as did headings reflecting narrow ethnicities or 
nationalities and headings for uncommon species. 

An alternative approach to selecting the target head- 
ings would be to have a threshold count in one or more test 
catalogs that each candidate heading would have to meet. 
This would provide a larger volume of data for determin- 
ing the use of old versus new forms in the target catalogs. 
Nevertheless, the samples used have generated sufficient 
evidence to provide useful data for this exploratory study. 
Furthermore, they demonstrate that any large catalog com- 
parable to those studied is likely to have a number of head- 
ing changes to attend to in every sequence of fifty changed 
headings appearing in these Web-distributed lists. The 
heading changes being promulgated in these lists are hav- 
ing a constant effect on the currency of headings in library 
catalogs. In the case of the most widely found headings, the 
effect is often also significant, with some heading changes 
affecting hundreds of bibliographic records. 



Catalogs 

The data gathered by the study and represented in the 
graphs in figures 1-3 indicate each catalog's performance 
against the measures being explored, heading currency and 
heading splits. Catalogs G, I, and K show a high propor- 
tion of consistently used new heading forms across all the 
sample sets. Catalogs B, C, E, and M show a predominance 
of consistent use of older forms. In some cases, the message 
is mixed; e.g., Catalogs A and D show a high proportion of 
consistently new headings in the oldest name sample, while 
consistently old forms still predominate in the more recent 
sample sets. Larger proportions of split headings were 
found in those catalogs with larger proportions of old head- 
ing forms — Catalogs A, B, C, and E. This indicates that the 
greatest reductions in split headings are achieved in catalogs 
that also show the greatest success in updating headings to 
their newer forms. 

The purpose of this exploratory study was primarily 
to develop a method that can demonstrate significant dif- 
ferences between catalogs and thereby provide a useful 
measure of performance. The study was not designed to 
explain these differences. However, a number of factors can 
be suggested. 

Some automated systems provide more efficient func- 
tionality than others for automatically updating authorized 
headings; however, no system in use at more than one CIC 
institution was found to correlate consistently with more 
current or more consistent headings. The extent to which 
any library is able to exploit its system's helpful features can 
vary depending on the availability of staff time and expertise 
and the press of other significant priorities. The inclusion in 
a library's system of current authority records and access to a 
vendor's authority processing service also might be factors in 
explaining the differences found between catalogs. 

Many catalogs are subject to influxes of older or oth- 
erwise problematic catalog records, e.g., when the records 
for a microfilm set or a retrospective conversion project 
are batchloaded or when records from a foreign vendor are 
loaded for acquisitions purposes. The fact that the present 
survey was carried out over a limited period of time may 
have meant that some catalogs were reviewed at a "bad 
time." Bepeating the data gathering exercise for the sample 
headings used here at a later date to determine how the pro- 
portions of new, old, and split headings might have shifted 
would be an interesting exercise; though once this paper is 
published, it may itself have an effect on the state of this 
particular set of sample headings. Bepeating the study with 
a new sample set of changed headings could amplify or cor- 
rect impressions left by the current study. 

In any case, the data from this study do support the 
notion that maintaining current, authorized, consistent 
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headings in the library catalog is an achievable goal. None 
of the catalogs performed perfectly in this regard, but per- 
fection is a Procrustean standard of measurement. The new 
appearance of old headings and of heading splits is a con- 
stant in catalog management, and can never be eliminated. 
Realistically, the goal should be to keep heading currency 
high and heading splits within tolerable limits, as deter- 
mined first by each library's policies and goals and then by 
an awareness of what peer institutions are achieving. The 
study results support this kind of comparative judgment and 
goal setting by revealing differences between catalogs and 
illustrating the relative success of some catalogs — here cata- 
logs G, I, and K — in responding to the challenge of main- 
taining current and consistent headings. Catalog data quality 
should not be taken for granted. The variation observable 
across catalogs in a performance measure — heading cur- 
rency and consistency — which is essential to interoperability 
and uniform search results, highlights both the need for 
greater effort and realistic benchmarks for success. 

Systems 

The term "systems" here refers broadly to the complex of 
rules, technology, and practice that governs the manage- 
ment of catalog headings. It does not refer simply or even 
primarily to integrated library systems. 

Many heading changes do not involve any change in 
the definition of the entity or concept named. The name 
headings "Abbey, Edward, 1927-" and "Abbey, Edward, 
1927-1989" represent the same entity, with or without a 
death date. "Breast feeding" and "Breastfeeding" represent 
the same concept, with or without the space. As long as 
what the authority record names does not change, changes 
to the name itself are easily managed, at least in principle. 
However, rules limiting the types of references permitted on 
LC authority records have made this situation more com- 
plex. LC policy currently does not allow the retention of the 
older, open-dated personal name heading form as a reference 
on LCNAF records when a death date is added. Similarly, 
no reference from an earlier LCSH subject heading form is 
allowed in some cases when LCSAF headings are updated. 
In January 2007 the MARC Advisory Committee and ALAs 
MARBI (Machine-Readable Bibliographic Information) 
Committee passed Proposal No. 2007-02, which intro- 
duces new coding to enable the inclusion of these kinds 
of references. 11 The proposal was approved in May 2007 
by the LC, Library and Archives Canada, and the British 
Library, though no implementation plans or dates have been 
announced (as of October 15, 2008). Implementing the 
proposed changes could lead to simpler and more standard- 
ized online system functions for automated maintenance of 
changed headings. 



In other cases the constancy of the definition of the 
changed heading is more problematic. Subject headings for 
open-ended periods in a country's history may have been 
used on bibliographic records for events that fall beyond a 
later-assigned closing date for the period. When the head- 
ing change is not one-for-one, each instance of the older 
form needs to be evaluated to ensure a correct revision. 
In many of the catalogs studied, including the LC's, clos- 
ing historical period headings — e.g., changing "Cuba — 
History— 1959-" to either "Cuba— History— 1959-1990" 
or "Cuba — History — 1990-" — lagged behind other types of 
subject heading maintenance. These kinds of changes will 
resist automated solutions and account in part for the larger 
number of split headings found for changed subject head- 
ings than for changed name headings. Developing more 
automated means for managing routine one-for-one head- 
ing changes would enable more staff time to be focused on 
those changes that require intellectual decisions. 

Lastly, the inefficiencies inherent in maintaining cata- 
log headings across multiple distributed catalogs could 
be addressed at the systems level. The more libraries can 
share a single record for bibliographic access, the fewer the 
records that will need to be maintained. This potential for 
increased efficiency is one of the motivations behind the 
current interest in OCLC's development of WorldCat Local, 
a catalog model that filters widely shared and maintained 
OCLC bibliographic records against each record's holding 
institutions to provide distributed access to local collections 
from a centralized database. If sufficient functionality can 
be built into this model to make it competitive with more 
conventional library systems, its advantages in terms of 
shared data management could be significant. 

Conclusion 

The method tested in this study bears out the hypothesis 
that examining headings across library catalogs for currency 
and consistency can produce quantified, comparable results, 
and can serve as one useful measure of catalog data quality. 
A study of this kind can indicate how well a library's catalog 
is performing in relation to locally established goals and to 
the catalogs of peer institutions, and it can indicate areas 
needing greater attention. The results also indicate that 
heading changes in the LCNAF and LCSAF are having 
negative as well as positive effects on catalog performance. 
Maintaining catalog headings is a constant challenge, and 
not one that is being universally or consistently met. 

The proposed method for data gathering and analysis 
could be improved upon in several ways. Better methods 
might be devised for selecting sample heading sets to 
reduce the number of unposted or rarely posted headings. 
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A more nuanced analytical approach might factor in the 
number of bibliographic records containing new versus old 
headings found in each catalog to produce a more balanced 
accounting of heading currency. Other sources of heading 
variations — e.g., typographical errors or unexplained variant 
forms — could be included to give a more complete measure 
of the occurrence of heading splits in the target catalogs. 

More research into explanatory factors could also prove 
valuable. Are there common elements in the technical ser- 
vices operations or system implementations of those librar- 
ies that do well on this measure? Do apparent patterns in 
the heading currency of particular catalogs reflect changes 
in policies or procedures and their effect? Would closer 
attention to the types of changed headings that do and do 
not receive prompt maintenance attention suggest alterna- 
tive ways of distributing this work? 

Lastly, the fundamental question behind this study 
remains unanswered — should the library catalog's data 
quality be evaluated as an outcome measure of the work of 
technical services? This study has attempted to demonstrate 
a practical method for such measurement, but it cannot 
answer the question of whether such measurement should 
be undertaken by a library or a group of libraries or included 
in models of library evaluation. Further discussion of that 
question would also be enlightening. 
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Appendix A. Changed LC Subject Headings, Sorted by Frequency of Use in CIC Catalogs 



Cancelled Term 


New Term 


List No. 


Catalogs 
Headin 


Angels (Judaism) 


Angels — Judaism 


2005.3 


13 


Black humor (Literature) 


Black humor 


2005.3 


13 


Breast feeding 


Breastfeeding 


2005.1 


13 


Breast feeding — Immunological aspects 


Breastfeeding — Immunological aspects 


2005.1 


13 


Calligraphy, Islamic 


Islamic calligraphy 


2005.5 


13 


Calligraphy, Zen 


Zen calligraphy 


2005.5 


13 


Children's web sites 


Web sites for children 


2007.1 


13 


China — Social life and customs — 1976— 


China — Social life and customs — 1976-2002 


2006.2 


13 


Crimes against humanity, German 


Crimes against humanity — Germany 


2007.4 


13 


Cross Florida Barge Canal (Fla.) 


Marjorie Harris Carr Cross Florida Greenway (Fla.) 


2007.5 


13 


Cuba — Foreign relations — 1959- 


Cuba— Foreign relations— 1959-1990 


2006.3 


13 


Cuba— History— 1959- 


Cuba— History— 1959-1990 


2006.3 


13 


Cuba — Politics and government — 1959- 


Cuba — Politics and government — 1959-1990 


2006.3 


13 


Definition (Logic) 


Definition (Philosophy) 


2007.4 


13 


Friendly societies 


Fraternal organizations 


2006.7 


13 


Friendly societies — United States 


Fraternal organizations — United States 


2006.7 


13 


Gutters 


Roof gutters 


2005.4 


13 


Hog cholera — Vaccination 


Classical swine fever — Vaccination 


2007.9 


13 


Hog cholera 


Classical swine fever 


2007.9 


13 


Indians of North America— Wars, 1868-1869 


Washita Campaign, 1868-1869 


2005.8 


13 


Insurance, Unemployment — Claimants 


Unemployment insurance claimants 


2007.2 


13 


Islam and terrorism 


Terrorism — Religious aspects — Islam 


2005.2 


13 


Kennebec Patent 


Kennebec Patent (Me.) 


2007.9 


13 


Knizhnik-Zamoldchikov equations 


Knizhnik-Zamolodchikov equations 


2006.3 


13 


Lady and the Unicorn (Tapestries) 


Lady and the Unicorn 


2005.4 


13 


Mexico — Politics and government — 1988- 


Mexico — Politics and government — 1988-2000 


2007.6 


13 


Online data processing — Downloading 


Downloading of data 


2006.1 


13 


Path analysis 


Path analysis (Statistics) 


2006.4 


13 


Puerto Rico— History— 1952- 


Puerto Rico— History— 1952-1998 


2007.2 


13 


Reciprocity 


Reciprocity (Commerce) 


2007.8 


13 


Reparation 


Reparation (Criminal justice) 


2005.2 


13 


South Africa — History — 1961- 


South Africa— History— 1961-1994 


2006.7 


13 


Student loan funds 


Student loans 


2006.1 


13 


Swastika 


Swastikas 


2005.2 


13 


Wages — Gas industry employees 


Wages — Gas industry 


2007.1 


13 


Weblogs 


Blogs 


2006.8 


13 


Ak Koyunlus (Turkic people) 


Ak Koyunlu (Turkic people) 


2006.5 


12 


Art, Papua New Guinea 


Art, Papua New Guinean 


2006.8 


12 


Church of England. Book of common prayer. Psalter 


Church of England. Psalter 


2005.1 


12 


Malawi — History — 1 964- 


Malawi— History— 1964-1994 


2007.1 


12 


New Zealand — History — Maori War, 1845-1847 


New Zealand— History— New Zealand War, 1843-1847 


2005.2 


12 


Papua New Guinea literature (English) 


Papua New Guinean literature (English) 


2006.8 


12 


Usury (Islamic law) 


Interest (Islamic law) 


2005.5 


12 


Amendments (Parliamentary practice) 


Legislative amendments 


2005.8 


11 


Friendly societies — Law and legislation 


Fraternal organizations — Law and legislation 


2006.7 


11 


Harmonica and electronic music 


Electronic and harmonica music 


2005.5 


11 


Karen language 


Karen languages 


2007.9 


11 


Lullabies, American 


Lullabies, English — United States 


2006.1 


11 


Pallavas 


Pallava dynasty, 4th-9th centuries 


2007.9 


11 



36 Hearn 



LRTS 53(1) 



Cancelled Term 


New Term 


List No. 


Catalogs 

ncuun 


loUla cum Midi I11UMC 


Dllcll clllU. LclUlcl 11 111 ML- 


2007.8 


I J 


TTttflr TClianH Rpcnnn ^TnHia^ 

U LLtll IVlltlllU ±Vt,£^lWll \_±nuici_/ 


T Tttfirfinrliftl fTnHifi^ 

\J LUXl allCllal ^lllUld J 


2005.2 


11 


/^UCldlll IJTclUlitl 1NCW \J LllllCcl UCUlflC J 


rt.UCldl.ll IJTtllJLUl 1>CW vl LllllCclll uCUUlCI 


9006 8 


10 


A If aHprnicliPcl^ii m^lii Hranmtiplipclfii tp>^tr /"^aint 
/^JvclUCllllCllCSR-ll lllalll Ul cllllelllCllCMVll LCall lOclllil 


A f\ f^TTi ir*ni :i cl/"ii m 1 rlrciTn ntir'rifcl^ii 
rt.lvclUCllllLllCaR.ll lllcllVl lllcllllclllCllChlvll LCtlll 


2005 4 


10 


Petersburg, Russia) 


(Saint Petersburg, Russia) 






Angels (Islam) 


Angels — Islam 


2005.3 


10 


Canons, fugues, etc. (Voice) 


Canons, fugues, etc. (Voices) 


2006.5 


10 


Electronic and harpsichord music 


Harpsichord and electronic music 


2005.5 


10 


Online data processing — Uploading 


Uploading of data 


2006.4 


10 


Siddhi (Indie people) 


Siddi (Indie people) 


2007.4 


10 


Yay language 


Bouyei language 


2007.2 


10 


Anostraca 


Fairy shrimps 


2007.1 


9 


Aranda language 


Western Arrernte language 


2005.3 


9 


Banda language 


Banda language (Central Africa) 


2007.9 


9 


Breast feeding — Law and legislation 


Breastfeeding — Law and legislation 


2005.1 


9 


Congridae 


Conger eels 


2006.4 


9 


Creeper lanes 


Climbing lanes 


2007.1 


9 


Midea Site (Greece) 


Midea (Extinct city) 


2005.4 


9 


Dargari language 


Tharrkari language 


2005.2 


8 


Karakoyunlus 


Kara Koyunlu (Turkic people) 


2006.5 


8 


Mesaras Plain (Greece) 


Mesara Plain (Greece) 


2005.2 


8 


Papua New Guinea fiction (English) 


Papua New Guinean fiction (English) 


2006.8 


8 


Saint Martin — Description and travel 


Saint Martin (West Indies) — Description and travel 


2006.3 


8 


Sculpture, Kota 


Sculpture, Kota (Africa) 


2006.4 


8 


Dolgan dialect 


Dolgan language 


2007.8 


7 


Karts (Midget cars) 


Karts (Automobiles) 


2006.5 


7 


Daba language 


Daba language (Cameroon and Nigeria) 


2007.4 


6 


Little League World Series, Williamsport, Pa. 


Little League World Series (Baseball) 


2005.4 


6 


Marriage (Luo law) 


Marriage (Luo (Kenya and Tanzania) law) 


2007.2 


6 


North Shore (Mass.) 


North Shore (Mass. : Coast) 


2005.1 


6 


Art, Parsic 


Parsee art 


2007.7 


5 


Back River (TMunavut) 


Back River (N.W.T. and Nunavut) 


2005.5 


5 


Back River Valley (Nunavut) 


Back River Valley (N.W.T. and Nunavut) 


2005.5 


5 


Caernarvon Castle (Caernarvon, Wales) 


Caernarfon Castle (Caernarfon, Wales) 


2006.4 


5 


Calligraphy, Buddhist 


Buddhist calligraphy 


2005.5 


4 


Calligraphy, Islamic, in art 


Islamic calligraphy in art 


2005.5 


4 


Calligraphy, Taoist 


Taoist calligraphy 


2005.5 


4 


Castel Roncolo (Bolzano, Italy) 


Castel Roncolo (Bolzano, Trentino-Alto Adige, Italy) 


2006.8 


4 


Cookery, Papua New Guinea 


Cookery, Papua New Guinean 


2006.8 


4 


Hog cholera — Diagnosis 


Classical swine fever — Diagnosis 


2007.9 


4 


Mass media in breast feeding promotion 


Mass media in breastfeeding promotion 


2005.1 


4 


Alfures (New Guinea people) 


Alfures (New Guinean people) 


2006.8 


3 


Emirian periodicals 


Emirati periodicals 


2007.3 


3 


Frake family 


Frakes family 


2007.7 


3 


Papua New Guinea drama (English) 


Papua New Guinean drama (English) 


2006.8 


3 


Pochard 


Common pochard 


2005.4 


3 


Public interest (Islamic law) 


Istislah (Islamic law) 


2006.8 


3 


Rajbangsi dialect 


Rajbangsi language 


2007.8 


3 


Sgaw Karen dialect 


Sgaw Karen language 


2007.9 


3 


Angels (Buddhism) 


Angels — Buddhism 


2005.3 


2 


Croatia — History — Zrinski-Francopan Conspiracy, 


Zrinski-Francopan Conspiracy, Croatia, 1664-1671 


2007.7 


2 



1664-1671 
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Cancelled Term 

Gabbard family 
Kayan language 
Kayu Agung dialect 
Kittiwake 

Liang Mountains (China) 

Polish American friendly societies 

Pwo Karen dialect 

Saint Martin — Antiquities 

Versatile Manufacturing Ltd. Strike, Winnipeg, Man., 

2000-2001 
Victor (Jet planes) 
Arnica (Drug) 

Fertu-Hansag Nemzeti Park (Hungary) 

Flower pots in art 

Hu Mountain (China) 

Hypsiglena ochrorhynchus 

Immoral contracts (Islamic law) 

Jardins du Prieure de Salagon (Mane, France) 

Kornelsen family 

Lewis and Clark Cavern State Park (Mont.) 

Mythology, Kota 

Pahute Mesa (Nevada) 

Ricinodendron rautanenii 

Samo language 

Satluj River (India) 

Scottish American friendly societies 

Spruce Island (Alaska) 

Taungthu dialect 

Alamblak (Papua New Guinea people) 
Ansitz Rottenbuch (Bolzano, Italy) 
Atlases, Emirian 

Bagore-ki-haveli (Udayapura, India) 
Baldwin Hills (Calif.) 
Bankudu-Balue language 
Congrina 

Economic assistance, Emirian 

English language — Augment 

Hare Island (Ireland) 

Lakhra Coal Field (Pakistan) 

Lullabies, Puerto Rican 

Lycopersicon pimpinellifolium 

Mist Gas Field (Oregon) 

Pangaimotu Island (Vava'u Group, Tonga) 

Peng Chau (China) 

Proverbs, Yay 

Sonda Coal Field (Pakistan) 
Songs, Gaviao 
Sylarna (Sweden) 
United States Highway 58 



New Term List No. 

Gebhardt family 2007.6 

Kayan language (Borneo) 2007.9 

Kayu Agung language 2007.9 

Kittiwakes 2006.6 

Liang Mountains (Shandong Sheng, China) 2007.8 

Polish American fraternal organizations 2006.7 

Pwo Karen language 2007.9 

Saint Martin (West Indies) — Antiquities 2006.3 
Buhler Versatile Inc. Strike, Winnipeg, Man., 2000-2001 2007.5 

Victor (Jet bomber) 2007.5 

Arnica montana — Therapeutic use 2007.4 

Ferto-Hansag Nemzeti Park (Hungary) 2007.9 

Flowerpots in art 2006.3 

Hu Mountain (Jiangsu Sheng, China) 2005.5 

Spotted night snake 2006.4 

Illegal contracts (Islamic law) 2005.5 

Jardins du Prieure de Salagon (Mane, Provence-Alpes- 2006.8 

Cote d'Azur, France) 

Cornelsen family 2007.9 

Lewis and Clark Caverns State Park (Mont.) 2005.6 

Mythology, Kota (Africa) 2006.4 

Pahute Mesa (Nev.) 2005.6 

Manketti 2005.6 

Samo language (Western Province, Papua New Guinea) 2007.4 

Sutlej River 2007.5 

Scottish American fraternal organizations 2006.7 

Spruce Island (Kodiak Island Borough, Alaska) 2005.2 

Taungthu language 2007.9 

Alamblak (Papua New Guinean people) 2006.8 
Ansitz Rottenbuch (Bolzano, Trentino-Alto Adige, Italy) 2006.8 

Atlases, Emirati 2007.3 

Bagore-ki-Haveli (Udaipur, Rajasthan, India) 2005.5 

Baldwin Hills (Calif. : Mountains) 2005. 1 

Bakundu-Balue language 2007.3 

Bathycongrus 2006.4 

Economic assistance, Emirati 2007.3 

English language — Augmentatives 2005.2 

Hare Island (Cork, Ireland) 2007.6 

Lakhra Coalfield (Pakistan) 2006. 1 

Lullabies, Spanish — Puerto Rico 2006. 1 

Currant tomato 2007.7 

Mist Gas Field (Or.) 2005.6 

Pangaimotu Island (Vava'u, Tonga) 2007.4 

Peng Chau (Islands District, China) 2005.3 

Proverbs, Bouyei 2007.2 

Sonda Coalfield (Pakistan) 2006.1 

Songs, Gaviao (Para, Brazil) 2007.7 

Sylarna (Sweden : Mountain) 2005.4 

United States Highway 5 8 (Va. and Term.) 2005.1 



Catalogs Using 
Headings 

2 
2 
2 
2 
2 
2 
2 
2 
2 
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Appendix B. Changed LC Name Headings, Sorted by Frequency of Use in CIC Catalogs 



Cancelled Name 


New Name 


OCLC List 
Date 


Catalogs Using 
Heading 


All J 11 /\ A "T 

Abbey, Edward, 1927— 


All T^J J 1 AA*7 1 AAA 

Abbey, Edward, 1927-1989 


AAA/ A1 AA 

20060208 


13 


Adams, Norman Isley, 1 895- 


Adams, Norman Isley, 1895-1985 


20060208 


13 


Adler, Mortimer Jerome, 1902— 


Adler, Mortimer Jerome, 1 902-200 1 


20060208 


13 


Albert, Stewart Edward, 1939- 


All j Ci* a. 1 ? J J 1 AO A 1AA/ 

Albert, Stewart Edward, 1939-2006 


1AA/ATAO 

20060208 


13 


All " 1 O A ~7 

Alley, Rewi, 1897- 


All 11 "1 OA*7 1 AAT 

Alley, Rewi, 1897-1987 


A A A / A A A O 

20060208 


13 


a £" i. ~\7~ ! i aaa 

Ararat, Yasir, 1929- 


A -C j- T T • 1 A A A A A A A 

Aralat, Yasir, 1929-2004 


AAA/"AAAO 

20060208 


13 


A 1 J T 1 * lAI^ 

Axelrod, Julius, 1912- 


A _1 J T !• 1A1A 1AA/I 

Axelrod, Julius, 1912-2004 


AAA/"AAAO 

20060208 


13 


[ -| 11 T -11 1 A 1 1 

Ball, Lucille, 1911- 


TA 11 T "11 1A11 1 nil A 

Ball, Lucille, 1911-1989 


A A A / A A A O 

20060208 


13 


Bancroft, Anne, 1931- 


Bancroft, Anne, 1931-2005 


A A A / A A A O 

20060208 


13 


Bethe, Hans Albrecht, 1906- 


Bethe, Hans Albrecht, 1906-2005 


20060208 


13 


taI 1 tt 1 j i r\AA 

Block, Herbert, 1909- 


TA 1 1 TT 1 _a 1AAA A A A 1 

Block, Herbert, 1909-2001 


AAA/AAAA 

20060208 


13 


Bolitho, Hector, 1 898— 


Bolitho, Hector, 1898-1974 


AAA/AAAA 

20060208 


13 


Brown, Herbert Charles, 1912- 


TA „ „ , . r „ T T ^ /AT, ~ „1 „ „ | ni 1 AAA/1 

brown, Herbert Charles, 1912— 2004 


20060208 


13 


Burns, George, 1896- 


Burns, George, 1896-1996 


AAA/AAA1 

20060201 


13 


/—\ ii -i t 4 r\ i r\ 

Callaghan, James, 1912- 


s~\ 11 1 T lAIAAAAr" 

Callaghan, James, 1912-2005 


AAA/AAAA 

20060208 


13 


Carson, Johnny, 192:>— 


Carson, Johnny, IVzd— 2005 


TAO^m AO 

20060208 


13 


Cheney, Brainard, lyOO— 


/" ' 1 , „ ta ' J 1AAA 1 AAA 

Cheney, Brainard, 1900—1990 


i AA«cnn AO 
20060208 


13 


Cheney, trances Neel, 1906- 


. T7„„„ „ „ \ T 1 1 AA/ 1 AA£ 

Cheney, Frances Neel, 1906-1996 


A AA/ AA AO 

20060208 


13 


"i i ni "1 i aa a 

Chisholm, Shirley, 1924- 


Chisholm, Shirley, 1924-2005 


AAA/AAAA 

20060208 


13 


Clark, Kenneth Bancroft, 1914— 


/^1 1 T r j 1 r -» j-v ini a AAA^ 

Clark, Kenneth Bancroit, 1914-2005 


AAA/AAAA 

20060208 


13 


Cochran, Johnnie L., 1937- 


Cochran, Johnnie L., 1937-2005 


20060208 


13 


I - 1 i /~ii i i /\ /-» o 

Ford, Charles, 1908- 


"I - * J /~11 1 1 AAO 1 AHA 

Ford, Charles, 1908-1989 


AAATAAAT 

20070207 


13 


TT J 1 t— ' i AAA 

Hadamowsky, Franz, 1900- 


TT J IT -1 1 AAA 1 AA^ 

Hadamowsky, Franz, 1900-1995 


20070207 


13 


Hayter, Stanley William, 1901— 


tt -j- o+ i , . it r;ii : i a a i i ao o 

Hayter, Stanley William, lyUl— 19o8 


A A A "7 A A A"7 

20070207 


13 


Hogben, Lancelot Thomas, 1895- 


Hogben, Lancelot Thomas, 1895-1975 


A A AT A A A~T 

20070207 


13 


TT ~Ti 1 T - 1 /r\ 1 T-il 1\ 1 /-\ r\ ~> 

Howes, Raymond F. (Raymond Floyd), 1903- 


TT "A J T 1 /T\ J T^l J\ 1 AA1 1 AO / 

Howes, Raymond F. (Raymond Floyd), 1903-1986 


AAATAAAT 

20070207 


13 


T T T TT 7 /T T - TT J 1 J \ 1 A 1 1 

Janson, H. W. (Horst Woldemar), 1913- 


T TTTT7/TT , .T f 11 \ 1 A i 1 1 AHA 

Janson, H. W. (Horst Woldemar), 1913-1982 


AAA*7AAAT 

20070207 


13 


TV" _ „ ^ J . , T„1~« T7 / T ~ 1, „ T - "; j.^, 1 J\ 1 A / A 

Kennedy, John r. (John ritzgerald), 1960— 


T/~ J.. T„1 T7 /T„l^ T7J a^ „ 1 J\ 1 A^A 1 AAA 

Kennedy, John t. (John ritzgerald), 196U-199y 


AAA/T AAA 1 
20060201 


13 


\/ _ < i . TTr;ii" „ a * 1A1A 

Kunstler, William Moses, 1919- 


\/ _^ ^ ■ 1 ^ _ TT7"11"„_ Ti K fW Tl Wl ft K \ 1A1A \ C\C\C 

Kunstler, William M. (William Moses), 1919-1995 


AAA*7AAAT 

20070207 


13 


La Sale, Antoine de, b. 1388.' 


T „ CI ^ A ^(-^™ ^ A~ nod 1 A £ 1 1 

La Sale, Antoine de, 1385 .'-1461 .' 


20060201 


13 


T J 1 A 1 A A ^ 

Landeck, Armin, 1905- 


T J 1 A " lAA^IAA^I 

Landeck, Armin, 1905-1984 


20070207 


13 


Lavon, Pinhas, 1904- 


T ~A"1 1 A A /I 1 AT/ 

Lavon, Pinhas, 1904-1976 


20070207 


13 


Ley, Hermann, 1911- 


T TT 1 A1 1 1 AAA 

Ley, Hermann, 1911-1990 


AAA /AAA 1 

20060201 


13 


Lowry, William R, 1927- 


t TTT'ii ■ n /ii7'ir "a j_a\ iA^n iaao 

Lowry, William P. (William Prescott), 1927-1998 


20070207 


13 


Mantle, Mickey, 1931- 


Mantle, Mickey, 1931-1995 


20070207 


13 


~K If ' T "J 1 O A / 

Massine, Leonide, 1896- 


H K ' T "J 1 O A X 1 O ~T A 

Massine, Leonide, 1896-1979 


AAA*7AAAT 

20070207 


13 


Moore, Barrington, 1913- 


Moore, Barrington, 1913-2005 


AAA/AOI /" 

20060816 


13 


T» T " Ti " 1 J W Jf /"A " 1 J "k SI 11 \ 

Nixon, Richard M. (Richard Milhous), 1913- 


XT' TTJ ' 1 111 /Tl " 1 J 1 *'11 „\ | AIO 1AA/I 

Nixon, Richard M. (Richard Milhous), 1913-1994 


AAA /AAA 1 

20060201 


13 


Obote, A. Milton (Apollo Milton), 1924- 


Obote, A. Milton (Apollo Milton), 1924-2005 


20060816 


13 


s~\ • T 1- -rr 1 1 ATA 

Onassis, Jacqueline Kennedy, 1929- 


S~\ T 1 " TV~ 1 1AAA 1 A A /I 

Onassis, Jacqueline Kennedy, 1929-1994 


AAA /AAA 1 

20060201 


13 


Osthoff, Helmuth, 1896— 


Osthoff, Helmuth, 1896-1983 


20060816 


13 


Peck, M. Scott (Morgan Scott), 1936- 


Peck, M. Scott (Morgan Scott), 1936-2005 


20060816 


13 


Redfield, William, 1927- 


Redfield, William, 1927-1976 


20060816 


13 


Rice, William, 1931- 


Rice, William, 1931-2006 


20060816 


13 


Rufer, Josef, 1893- 


Rufer, Josef, 1893-1985 


20060816 


13 


Sale, William Merritt, 1899- 


Sale, William Merritt, 1899-1981 


20060816 


13 


Salgado, Plinio, 1895- 


Salgado, Plinio, 1895-1975 


20060816 


13 


Salisbury, Harrison Evans, 1 908- 


Salisbury, Harrison E. (Harrison Evans), 1908-1993 


20060816 


13 


Tikhonov, Nikolai Semenovich, 1 896- 


Tikhonov, Nikolai Semenovich, 1896-1979 


20060816 


13 


Van Allen, James Alfred, 1914— 


Van Allen, James A. (James Alfred), 1914-2006 


20060816 


13 


Achelis, Elisabeth, 1880- 


Achelis, Elisabeth, 1880-1973 


20060208 


12 
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Cancelled Name New Name OC n LC . List Catalogs Using 

Dare Heading 

Benenson, Peter, 1921- Benenson, Peter, 1921-2005 20060208 12 

Bernhard Leopold, Prince, consort of Juliana, Bernhard Leopold, Prince, consort of Juliana, 20060208 12 

Queen of the Netherlands, 1911- Queen of the Netherlands, 1911-2004 

Bronfenbrenner, Urie, 1917- Bronfenbrerrner, Urie, 1 9 1 7-2005 20060208 12 

Erskine, Ralph, 1914- Erskine, Ralph, 1914-2005 20070207 12 

Harley, Robison D. 1911- Harley, Robison D. 1911-2007 20070207 12 

Hoffmeister, Adolf, 1902- Hoffmeister, Adolf, 1902-1973 20070207 12 

Hojo, Hideji, 1902- Hojo, Hideji, 1902-1996 20070207 12 

Luzzati, Emanuele, 1921- Luzzati, Emanuele, 1921-2007 20070207 12 

Machel, Samora, 1933- Machel, Samora, 1933-1986 20070207 12 

Malcolm, George John, 1917- Malcolm, George, 1917-1997 20070207 12 

Meyer, Ernst Hermann, 1905- Meyer, Ernst Hermann, 1905-1988 20060816 12 

Nelson, Gene, 1920- Nelson, Gene, 1920-1996 20060816 12 

Porter, Charles Orlando, 1919- Porter, Charles Orlando, 1919-2006 20060816 12 

Schmid, Daniel, 1941- Schmid, Daniel, 1941-2006 20060816 12 

Seel, Pierre, 1923- Seel, Pierre, 1923-2005 20060816 12 

Simpson, Robert Wilfred Levick, 1921- Simpson, Robert, 1921-1997 20060816 12 

Stein, Fritz Wilhelm, 1879- Stein, Fritz Wilhelm, 1879-1961 20060816 12 

Edmunds, Murrell, 1898- Edmunds, Murrell, 1898-1981 20070207 11 

Eliscu, Edward, 1902- Eliscu, Edward, 1902-1998 20070207 11 

Habermann, Abraham Meir, 1901- Habermann, Abraham Meir, 1901-1980 20070207 11 

Haksar, P. N. (Parmeshwar Narain), 1913- Haksar, P. N. (Parmeshwar Narain), 1913-1998 20060201 1 1 

Lewis, Joseph H., 1907- Lewis, Joseph H., 1907-2000 20070207 11 

McAvoy, May, 1901- McAvoy, May, 1901-1984 20070207 11 

Pasternak, Joe, 1901- Pasternak, Joe, 1901-1991 20060816 11 

Santorsola, Guido, 1904- Santorsola, Guido, 1904-1994 20060816 11 

Tubb, Ernest, 1914- Tubb, Ernest, 1914-1984 20060816 11 

Elwood, Muriel, 1902- Elwood, Muriel, 1902-1976 20070207 10 

Gonzales, Pancho, 1928- Gonzales, Pancho, 1928-1995 20070207 10 

Johari, Harish, 1934- Johari, Harish, 1934-1999 20060201 10 

Michelin, Bernard, 1918- Michelin, Bernard, 1 9 1 5-2003 20060816 10 

Strock, Herbert L., 1918- Strock, Herbert L., 1918-2005 20060816 10 

Flanders, Ed, 1934- Flanders, Ed, 1934-1995 20070207 9 

Foss, Joe, 1915- Foss, Joe, 1915-2003 20060201 9 

Frohner, Adolf, 1934- Frohner, Adolf, 1 934-2007 20070207 9 

Jordan, Richard, 1938- Jordan, Richard, 1938-1993 20070207 9 

Kamleshwar, 1932- Kamleshwar, 1932-2007 20070207 9 

O'Brien, Virginia, 1921- O'Brien, Virginia, 1921-2001 20060816 9 

Okada, Jo, 1911- Okada, Jo, 1911-1981 20060816 9 

Rybar, Peter, 1913- Rybar, Peter, 1913-2002 20060816 9 

Alice, Duchesse of Glouchester, 1901- Alice, Duchesse of Glouchester, 1901-2004 20060208 8 

Banner, Donald W., 1924- Banner, Donald W., 1924-2006 20060208 8 

Brown, Clarence, 1924- Brown, Clarence, 1924-2005 20060208 8 

Defore, Don, 1917?- Defore, Don, 1913-1993 20070207 8 

Keshet, Yeshuran, 1893- Keshet, Yeshurun, 1893-1977 20070207 8 

Lishner, Leon, 1913- Lishner, Leon, 1913-1995 20070207 8 

Moulton, Augustus Freedom, 1848- Moulton, Augustus Freedom, 1848-1933 20060201 8 

O'Brien, George, 1927- O'Brien, George, 1927-2005 20060816 8 

Philips, Frits, 1905- Philips, Frits, 1905-2005 20060816 8 

Elkoshi, Gedaliah, 1910- Elkoshi, Gedaliah, 1910-1988 20070207 7 

Galai, Binyamin, 1921- Galai, Binyamin, 1921-1995 20070207 7 

McManus, Frederick R. (Frederick Richard), 1923- McManus, Frederick R. (Frederick Richard), 1923-2005 20060816 7 

Pandey, Sangam Lai, 1929- Pandey, Sangam Lai, 1928-2002 20060201 7 

Previn, Charles, 1888- Previn, Charles, 1888-1973 20060816 7 
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Cancelled Name 


New Name 


OCLC List 
Date 


Catalogs Using 
neaaing 


Rousselot, Louis M. (Louis Marcel), 1902— 


Kousseiot, couis ivi. (Louis Marcel), iyuz— iy/4 


7AA£A8 1 A 

ZUUOUo 10 


1 


CinrtVi l\/Ir\l-i"n 100^ 

oingn, ivionan, iyuj— 


QinrrVi A/lnlian 1 QA^ 1 078 

oingn, ivionan, iyuj— iy/o 


ZUUOUO 10 


I 


rielou, cnaries, lyiz— 


rielou, cnaries, lyiz— zuui 


7AA7A7A7 

ZUU /uzu / 





tj,,„ ion 
riunter, Joe, iyz/— 


n,„.f al . t„„ 1 nn 7AA7 

riunter, Joe, iyz/— zuu/ 


7AA7A7A7 

ZUU /uzu / 





Montana Mini iyu4— 


Mnntnnn Oli'm 1 QA/1 1 OQt 

Montana Mim iyi)4— iyyo 


7AA£A8 1 A 

ZUUOUo 10 





Parker, Lester Shepard, b. 1860 


D nl .|,,,. T Cl, Q „nr(l 1 8AA 1 QO^ 

rarKer, cester onepara, loou— lyzD 


7AA£A7A 1 
ZUU0UZU1 





Silsoe, Malcolm Trustram Eve, Baron, 1894— 


oiisoe, Malcolm irustram nve, uaron, ioy4— iy/o 


7AA£A7A 1 
ZUU0UZU1 





TV117I A 1 1 Q/1 8 

rayior, ai, iy4o— 


Trujlr,.. a 1 1 QA 8 1 OQQ 

rayior, iy4o— iyyy 


7AAAA8 1 A 
ZUUOUO 10 


A 



rttiier, rrieuncn, d. io/o 


A A 1 1=1 j- Priori r-Vi 1 878 1 QA0 

rtuier, rneciriLn, io/o— ly^tz 


ZUUOUZUO 




riaiiey, uereK, iyiu— 


rSailey, uereK, lyju— zuud 


7AA£A7AC 

ZUUOUZUo 


c 

J 


Furness, Betty, 1916— 


rurness, oetty, lyio— iyy4 


7 AA7A7 A7 
ZUU /UZU / 


J 


1 0*70 

iviager, Crus, lo/o— 


Mager, ous, lo/o— iyjo 


7 AA7A7 A7 
ZUU /UZU / 


c 

J 


iviccriiiert, uavia ri., iyzo— 


Mccriiiert, uavia r,., lyzo— zuuj 


7AAAA8 1 A 
ZUUOUO 10 


J 


Moore, Garry, 1915— 


Moore, vjarry, iyij— iyyj 


7AAAA8 1 A 
ZUUOUO 10 


J 


fellings, Artnur, lyzi— 


Calling A rtl-.ii.- 1 QO 1 1 QA8 

sellings, Artnur, lyzi— iyoo 


7AAAA8 1 A 
ZUUOUO 10 


J 


Curtis, J acKie, iy4/— 


Curtis, JacKie, iy4/— iyoj 


7 AA£A7 A 1 
ZUU0UZU1 


A 
4 


Kagoshima, Juzo, 1898— 


Kagosmma, juzo, loyo— iysz 


7 AA7A7 A7 
ZUU /UZU / 


4 


Kaaciine, lea, iyuz— 


Kaaciine, lea, iyuz— zuio 


7AA£AS 1 A 
ZUUOUO 10 


A 

4 


Jergens, Adele, 1922— 


Tfn-ftfinf a H^ifi ioi7 onm 
jergens, /\ueie, ivi /— zuuz 


70070707 
ZUU /UZU / 


1 


i^dnsing, KooerL, ivzy— 


T ancirnr P nl-iprt 1070 1 QQ/1 

i^dxisrng, Koueri, ivzy— lyy^t 


70070707 
ZUU /UZU / 


1 
J 


McGee, J. Vernon (John Vernon), 1904— 


McCree, J. Vernon (jonn vernonj, iyu4— iyoo 


7AA7A7A7 
ZUU /UZU / 


1 
J 


Reece, Arley, 1945— 


T> aorta Aria,, 1 Q/l C TAAC 

Keece, Arley, iy4j— zuuj 


7AA£A8 1 A 
ZUUOUO 10 


1 
J 


oaKazaKi, bnizuKa, loo/— 


Colro-rnH Ct,;,, .Ur, 1 QQ-7 1 0"70 

oaKazaKi, onizuKa, loo/— iy/o 


7AA£A8 1 A 
ZUUOUO 10 


1 
J 


c m ;ti-i CfKai loio 
omitn, ntnei, iyiu— 


Cm.'+U TJ+lial 1Q1 A 1 QQA 

omitn, ntnei, iyiu— iyyo 


7AA£A8 1 A 
ZUUOUO 10 


1 
J 


Gainsborg, Lolita Cabrera, 1895— 


Cramsuorg, coiita caurera, loyj— lyoi 


7 AA7A7 A7 
ZUU /UZU / 


z 


Gos, Francois, b. 1880 


Cros, rrancois, loou— iy/j 


7 AA£A7 A 1 
ZUU0UZU1 


z 


carpenter, cnnton a. iyzi— 


carpenter, cnnion a. iyzi— zuuj 


70A£07A8 
ZUUOUZUo 




Carrasquilla L., Juan de Dio, b. 1833 


r^QT-rQcnuillQ T Tunn At* Pti ^ 1811 l OAS 

Cdixdso^Liiiid i^., juan ue uio, iojj— ivuo 


700^0708 
ZUUOUZUO 




rinnioln Tr\l-i« c 1 qa*^ 

uanieis, jonn lyuo— 


T^nniolc TnVin C 1 OAA 1 OOA 

uanieis, jonn lyuo— iyyo 


700707A7 
ZUU /UZU / 




Fisher, Doris, 1915— 


T7i fli ar T\rtrin 1 O 1 C 7 AA1 

risner, uoris, iyij— zuuj 


7 AA7A7 A7 
ZUU /UZU / 




Toman (~" i^r>'t 1 1 Q 1 7 

James, Cecil, lyu— 


j nmac , f^r.A 1011 1 OQQ 

James, Cecil, lyu— iyyy 


7 AA7A7 A7 
ZUU /UZU / 




ceignton, cee, lyuo— 


T aJ/-rVit^i-i T nn 1 QAA 1 OOA 

ceignton, cee, lyuo— iyyo 


7AA7A7A7 
ZUU /UZU / 


1 


\Ar\iAy D i r-V\ o rA 10/1/1 
IVIOLK, JvlCndrtl, 1 7HH— 


AAnnV T? inViQ-rH 1 0AA 7 AAA 
IVIOLK, KlLnaru, 1V44~ ZUUO 


700^08 1 A 
ZUUOUO io 




Roger, Roger, 1911— 


D Ariar TJ n(Tor 1011 100^ 

Koger, Koger, iyii— iyyj 


700^08 1 A 
ZUUOUO 10 




Sampson, Alistair, 1929— 


Qomr\f nn Alictoit- 107Q 7AAA 

Sampson, Alistair, iyzy— zuuo 


700^08 1 A 
ZUUOUO 10 




Cii11ii>n<-> XT^»i1 \7 fXTa! 1 \f:„A™A l (IK 

sullivan, JNeil V. (JNeil Vincent), iyi3— 


C„|i;,, nv , "Mn.'l \F /XT nil "\7iimnnA 1 f) 1 ? 1AAC 

Sullivan, JNeil V. (JNeil Vincent), iyij— ZUU5 


7nA£no 1 a 
ZUUOUo 10 




Anderson, Sigurd, 1904— 


A n^amnn C I mird 1 OA/1 1 OQA 

Anderson, Mgura, iyu4— iyyu 


7AA£A7A8 

ZUUOUZUo 


ft 

u 


Riit^ii Trio ion 
oiiicn, ins, iyiz 


Rlit^Vi Trie 1017 1001 

rsutcn, ins, iyiz— iyyj 


700A07A1 
ZUUOUZU 1 


A 
U 


cnaoioz, rntz, D. io4i 


r~U nKl T7ritT 18/11 1 QA^ 

cnauioz, rntz, io4i— iyuj 


7AA£A7AC 

ZUUOUZUo 


ft 

u 


Devine, Bing, 1916— 


n,, ,: n „ T3inn 1 Ql i 7AA7 

uevine, Ding, lyio— zuu/ 


7AA7A7A7 

ZUU /uzu / 


U 


tvans, w. K. (William Kees), iyiu— 


T7,7-inc WT T? Ali?illlnm D Qa[ ,^ 1Q1A 1 OQ 1 

nvans, w. K. ^William Kees), iyiu— iyyi 


7AA7A7A7 

ZUU /uzu / 


ft 
U 


Fujimoto, Yoshimichi, 1919— 


TTniitnMA VAc Viiminli,' 1Q1Q 1 QQ7 

rujimoio, losnimicni, iyiy— iyyz 


700707A7 
ZUU /UZU / 


A 
U 


Javierre Ortas, A. M. (Antonio Maria), 1921— 


Javierre Urtas, A. M. (Antonio Maria), lyzi— ZUU/ 


TAA"7ATA1 

ZUU /uzu/ 


U 


Johnson, Ernie, 1943— 


Johnson, Ernie, 1943-2005 


20070207 





Kavanaugh, Ken, 1916- 


Kavanaugh, Ken, 1916-2007 


20070207 





Nelson, Paul, 1936- 


Nelson, Paul, 1936-2006 


20060816 





Puzo, Mario, 1920- 


Puzo, Mario, 1920-1999 


20060816 





Szabo, Sandor, 1915- 


Szabo, Sandor, 1915-1997 


20060816 





Tichenor, Jerome, 1911- 


Tichenor, Jerome, 1911-2006 


20060816 





Udam, Haljand, 1936- 


Udam, Haljand, 1936-2005 


20060816 





Vidal, Pietro,b. 1867 


Vidal, Pietro, 1867-1938 


20060201 
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Notes on Operations 

Supporting Name Authority 
Control in XML Metadata: 
A Practical Approach at the 
University of Tennessee 

By Marielle Veve 



While many different endeavors to support name authority control in Extensible 
Markup Language (XML) metadata have been explored, none have been accepted 
as a best practice. For this reason, libraries continue to experiment with the sche- 
ma, tool, or process that best suits their local authority control needs in XML. This 
paper discusses current endeavors to support name authority control in XML for 
digitized collections and demonstrates an innovative manual solution developed 
and implemented by the University of Tennessee Libraries to achieve this goal. 
Even though this method for authority control in XML metadata still relies on 
manual efforts, it effectively reduces time and research work by efficiently setting 
priorities, identifying critical descriptive areas in the digital transcriptions, and 
identifying the most appropriate biographical resources to consult. The effective- 
ness of this approach in improving the rest of the metadata production workflow 
is evaluated and presented. 

Soon after starting digitization projects, many libraries and other institutions 
often find that keeping track of name access points in the Extensible Markup 
Language (XML) is a huge challenge, regardless of the XML schema used. This 
is particularly the case in many types of digitized objects such as manuscripts, 
music, and other types of special collections where the number of personal 
names is exponentially more than the number of items digitized; the names are 
dispersed all over the digitized transcriptions; and information about these names 
is ambiguous, vague, and incomplete. However, no matter how difficult keeping 
track of name access points in digitized materials is, it is necessary in order to keep 
digitized objects retrievable. Access points not only help in the retrieval process of 
documents, but also help keep materials by the same creators or about the same 
subjects together. 

To keep a successful track of name access points in XML documents, 
libraries have been experimenting with many different endeavors to find an 
effective way to achieve this goal. So far the efforts created to support name 
authority control in XML metadata consist of (1) using XML schemas to encode 
authority data; (2) endeavors for shared, cooperative, national, and interna- 
tional XML name databases; (3) manual and automated conversion tools from 
Machine-Readable Cataloging (MARC) to XML; and (4) automated generation 
of authority control through especially designed systems. The problem with most 
of these endeavors is that they only address the issue of how to encode name 
access points utilizing XML authority schema; they do not address the issue of 
how to extract or harvest these names directly from the XML records and trans- 
form them into useful access points. The few endeavors that have tried, such 
as the systems for automated generation of authority control, have only been 
successful in extracting names from XML records but not in turning them into 
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reliable access points. This is because 
their name matching processes fail. 
For this reason these endeavors will 
always depend on human intervention 
to work properly. In addition to not 
being completely reliable, many of 
these endeavors are costly and labor 
intensive, not to mention that most 
of them only display newly created 
access points locally. 

The method introduced in this 
paper to support name authority 
control in XML metadata addresses 
the issue of extracting or harvesting 
names directly from XML documents 
manually and turning them into use- 
ful access points. In contrast to the 
previously mentioned methods, this 
method is effective, relatively sim- 
ple, cost efficient, and has the ability 
to display the new access points at 
the national level by still using the 
richest authority file available — the 
Library of Congress's (LC) author- 
ity file (LCAF, http://authorities.loc 
•gov). This method consists of a simple 
manual approach to extract and create 
name access points that effectively 
reduces time and research efforts by 
efficiently setting priorities, identify- 
ing critical descriptive areas in the 
digital transcriptions, and identify- 
ing the most appropriate biographical 
resources to consult. When using this 
method, libraries will not have to go 
through the work of encoding author- 
ity data into XML schemas or translat- 
ing authority data from one schema 
to another. Neither will they have to 
worry about hiring a programmer to 
build an XML name repository to store 
these records nor to create "shareable" 
XML metadata in order to make local 
authority records interoperable within 
national and international cooperative 
XML authority databases. Finally, this 
method is a practical alternative for 
those libraries and institutions that do 
not plan to build an automated tool to 
extract names directly from the XML 
records, which so far has not proven 
to be a reliable alternative. 



The University of Tennessee 
Libraries' Name Authority 
Challenge 

At the beginning of 2007, the University 
of Tennessee Libraries (UTL) trans- 
ferred the creation of descriptive meta- 
data for digitized manuscripts from 
the Digital Library Center (DLC) to 
the Technical Services Department. 
After archives were scanned and digi- 
tized in the DLC, digital surrogates of 
the manuscripts were created using 
the Text Encoding Initiative (TEI) 
schema. TEI is a markup language 
for representing structural and con- 
ceptual features of texts. It is used 
primarily for the encoding of docu- 
ments in the humanities and social 
sciences and, in particular, in the rep- 
resentation of primary source materi- 
als for research and analysis. Files 
in TEI were sent to the cataloging 
department to be transformed into 
rich descriptive metadata using the 
Metadata Object Description Schema 
(MODS), UTLs selected schema for 
digitized manuscripts. 1 

As a requirement of using MODS, 
catalogers have to use controlled 
vocabularies to assign access points to 
the records. Soon after receiving their 
first batch of TEI encoded records, 
catalogers encountered serious diffi- 
culties in assigning personal names to 
the access points of MODS records. 
The following were the main prob- 
lems: 

• Difficulty in finding names 
in TEI records in the LCAF. 

This problem occurred because 
either the record did not exist 
in the LCAF or because names 
in the TEI records could not 
be matched with names in the 
LCAF because of the lack of 
sufficient biographical informa- 
tion in the TEI records to iden- 
tify individuals. For example, 
proving that the individual in 
the TEI record was the same 
one listed in the LCAF was 



difficult because no data other 
than name were given. 

• Inconsistency in the estab- 
lishment of names not found 
in the LCAF. When names 
were not found in the LCAF, 
different catalogers assigned 
different headings for the same 
person, depending on the form 
of the name given in the manu- 
script. Entering the same indi- 
vidual's name in many different 
ways can create a serious prob- 
lem for future discovery and 
access. 

• Difficulty in differentiat- 
ing individuals with similar 
names within the same collec- 
tion. Many people whose names 
appeared in the manuscripts in 
the UTL collection shared the 
same or similar names with rela- 
tives mentioned in the collec- 
tions. Distinguishing between 
two or more individuals with 
similar names became difficult 
because little, if any, biographi- 
cal data were provided in the 
manuscripts. To make matters 
worse, individuals sometimes 
were called only by nicknames 
or had very commonly used 
names, which were difficult to 
differentiate from other similar 
headings. These factors created 
confusion for catalogers and 
made the process of differentia- 
tion almost impossible. 

• Uncertainty about how to 
handle misspelled names and 
other typographical errors 
in the TEI transcriptions. 
Sometimes errors were made 
in transcribing names from 
the digitized image to the TEI 
files. Catalogers did not know 
whether to go back and fix the 
misspelling by editing the TEI 
record or to create an access 
point using the form found in 
the manuscript, even if it were 
a misspelled form. Different 
decisions made by various 
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catalogers brought more incon- 
sistency to the access points. 

The problems in assigning per- 
sonal headings to the access points of 
MODS records demanded an effective 
method to handle name authority con- 
trol in the UTL's digitized collections. 

Literature Review 

Libraries and other institutions seek- 
ing to support name authority control 
in XML metadata have tried various 
approaches. Some have proven more 
successful than others, but none has 
consistently been implemented for 
XML documents. Some commonly 
mentioned approaches in the library 
literature include Metadata Authority 
Description Schema (MADS), 
MARC Extensible Markup Language 
(MARCXML), Encoded Archival 
Context (EAC), OCLC Linked 
Authority File (OCLC LAF), and the 
Automated Name Authority Control 
(ANAC). Most of these authority ini- 
tiatives for non-MARC metadata are 
designed to handle authority control 
at the local level; only a few try to 
do so at the national level. Some are 
XML schemas for authority elements 
created for use in conjunction with 
particular XML bibliographic sche- 
mas. Others are conversion tools that 
convert MARC into XML records. 
Some authority initiatives claim to 
be automated, but usually these are 
really semiautomated approaches 
that apply a mixture of manual and 
automated approaches to generate 
authority control. 

XML Schemas for Authority 
Elements in Non-MARC Metadata 

In the early 2000s, the LC created 
MARCXML, an XML schema that 
can be used for authority purposes 
and is based on and very similar to 
MARC 21. It was first presented by the 
Information Technology Section at the 



IFLA conference in Glasgow in 2002. 
In a recent report of that meeting, 
McCallum states that "a key character- 
istic of MARCXML is that it produces 
an exact equivalent of the MARC 21 
record so that roundtrip conversion to 
and from it is lossless. This schema has 
been widely used and is the basis for 
the international standard for an XML 
version of the MARC structure that 
Danish standards have proposed." 2 In 
summary, she concludes that MARC/ 
XML "provides a basis for evolution 
while maintaining standardization." 3 

Later in 2005, the LC developed 
MADS, another schema for authority 
elements, but this one was created to 
be used in conjunction with MODS, 
a particular bibliographic schema. As 
in the case of MARCXML, MADS 
also has a strong relationship with the 
MARC 21 authority format. Guenther 
describes one advantage of MADS: 
"Because MADS is derived from the 
MARC 21 Authority format, which has 
been used for more than 30 years, its 
underlying model is well-established 
[and] a MODS description could link 
to a MADS description to eliminate 
redundant information." 4 She also 
mentions disadvantages: "Since MADS 
has not yet been widely implemented, 
it could still be considered experimen- 
tal, and wider experience using it may 
result in refinements to the schema." 5 

EAC (www.library.yale.edu/eac) 
is another schema for authority ele- 
ments created to be used in con- 
junction with a bibliographic schema, 
Encoded Archival Description (EAD). 
EAC started as an original effort 
from a group of archivists who met 
in Toronto in March 2001 to create a 
model for name authority control in 
archival materials. The initiative, still 
in the beta phase, is currently managed 
by an international group of archi- 
vists and Yale University. Thurman 
explains that EAC allows "archivists to 
encode information [in XML] about 
the creators and context of creation 
of archival materials, and to make that 
information available to users as an 



independent resource separate from 
individual finding aids." 6 He notes that 
EAC "development is not yet com- 
plete, and it has so far been imple- 
mented only experimentally."' In the 
effort to create an XML encoding stan- 
dard for archival authority control, Pitti 
concludes that "there are many dif- 
ficult intellectual, technical, cultural, 
linguistic, and political challenges to be 
addressed in order for the effort to be 
successful. While all of the challenges 
are significant, the political challenges 
stand out as particularly difficult." 8 

The MARC Extensible Markup 
Language Document Type Definition 
(MARCXML DTD), which is not the 
same as the MARCXML schema, is 
an older schema format for XML 
created by the LC. It started in the 
mid-1990s as an SGML DTD that 
supported the conversion of data 
from MARC Authority to SGML (and 
back) without loss of data. In the early 
2000s, as technology developed and 
changed, the MARC SGML DTD 
became converted to MARCXML 
DTD. McCallum states in her report 
that this method "yielded very large 
DTDs since [XML DTD] is naturally 
verbose, and the tagging approach 
mandated a DTD element specifi- 
cation for every MARC subfield or 
coded character position." 9 An entry 
in Wikipedia summarizes the prob- 
lems with DTD, noting that it is 
limited because "it has no support 
for newer features of XML, most 
importantly namespaces; uses a cus- 
tom non-XML syntax, inherited from 
SGML, to describe the schema; and 
lacks expressiveness [because] certain 
formal aspects of an XML document 
cannot be captured in a DTD." 10 
Nonetheless, even through its limita- 
tions, MARCXML DTD is still used 
and is kept available in the MARC 21 
website. The reason some keep using 
it is that "several users have stated that 
they find it appropriate for certain 
applications, especially those needing 
extensive validation of records." 11 

Libraries that decide to use any of 
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these schemas to encode their XML 
authority data first will have to decide 
which names they want to extract from 
the XML records to be used as access 
points. This process has to be done 
manually because XML authority sche- 
mas do not extract information directly 
from XML records, but only encode 
it. The chosen names are then turned 
into access points, for which research 
is required. Next, the information is 
encoded into the desired XML author- 
ity schema. After this is done, a local 
XML name repository will need to be 
built and sustained to store and retrieve 
these authority records in XML. The 
problem with relying on XML name 
repositories for authority control is that 
catalogers frequently do not have the 
technological background to build or 
sustain the repositories. For these tasks 
catalogers often will have to rely on 
the library's programmers, who have 
competing responsibilities such as tech- 
nical support or database and catalog 
maintenance. Hiring a programmer to 
work exclusively with the technical ser- 
vices department may be an option, but 
can be very expensive. For these rea- 
sons, the use of schemas for metadata 
authority control may not be the best 
solution for some libraries. 

Conversion Tools from MARC to 
XML Authority Schemas 

Another option for metadata author- 
ity control involves taking names that 
appear in the LCAF (in MARC format) 
and converting them into XML sche- 
mas using conversion tools between 
MARC and XML. Some of these tools 
involve automation, while others do 
not. An example of an automated 
conversion tool between MARC and 
XML is the MARC Tool Kit. This tool 

provides converters for trans- 
forming data from MARC 21 
to MARC-XML and back, 
including character set con- 
version to and from Unicode. 
These converters can be 



downloaded from the MARC 
website and used by others 
in their own systems where 
they can also shape them to 
their own data and needs. 
[This] conversion software 
was developed by Bas Peters 
in the Netherlands and made 
available by him as open 
source software. It is in part 
adapted from an extensive set 
of programs for manipulating 
MARC 21 data. 12 

The LC sees these transforma- 
tions provided from the MARC 21 
maintenance agency as "being valuable 
to the community to help maintain the 
savings and interoperability built up 
through use of a common format." 13 

Maps and crosswalks between 
MARC and XML are other types of con- 
version tools used to translate author- 
ity data from one schema to another. 
These tools use manual approaches 
and, for this reason, require more 
effort on the part of the cataloger. The 
number of this type of conversion tool 
parallels the number of XML sche- 
mas. Some include conversions from 
MARC to Dublin Core and Dublin 
Core to MARC, others from MARC 
to MODS and vice versa. Almost all 
XML schemas have a crosswalk to 
convert their schemas into MARC 
metadata or from MARC to XML. 
An assessment published in Online 
Libraries and Microcomputers reveals 
some of the common challenges faced 
when using crosswalks and maps. 14 
This analysis reports that 

there is often not a one 
for one mapping between 
fields in different metadata 
schemes. This means that 
many fields may need to be 
mapped into fewer fields (or 
vice versa). There can be a 
loss of granularity in metadata 
descriptions that may result 
in poorer searching. Many 
specific metadata schemes 



are targeted to a specific 
subject or type of material. 
When converting to another 
scheme there may be a loss of 
specificity and granularity. In 
metadata mapping one may 
want to parse through free 
text data to extract relevant 
data to extract for a more 
detailed scheme. This is dif- 
ficult, time consuming and 
fraught with error because of 
variations in actual content. 
. . . How does one handle 
subfields and indicators (e.g., 
MARC) when mapping to 
systems that do not support 
the same detail? How should 
subfields from more complex 
metadata schemes be delim- 
ited in less complex meta- 
data schemes? How does one 
map and handle local con- 
trol numbers? Without the 
transferring of local control 
numbers there may be later 
problems in a shared data- 
base for updates, deletes and 
overlapping records. 15 

The idea of converting MARC 
authority records into records that use 
the local XML schema sounds appeal- 
ing, but this method creates double 
work for the library. Converting author- 
ity records from MARC to another 
metadata schema requires translation 
of records plus the construction of 
an XML name repository to support 
the records. Many of the manuscript 
names do not exist in the LCAF, so 
locally established headings will have 
to be created for these names follow- 
ing the construction format of head- 
ings in the LCAF. Following the same 
construction format keeps consistency 
between locally created headings and 
those exported from the LCAF so that 
the headings look the same and index 
the same way. 

If many headings have to be local- 
ly established in XML schema fol- 
lowing the rigorous LCAF standards, 
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then libraries may find establishing the 
headings directly in the LCAF more 
worthwhile because other libraries can 
benefit from this authority work. This 
approach can also save the time nec- 
essary to convert names to another 
schema and to build a database to 
manage them. For these reasons, rely- 
ing on conversion of authority records 
from MARC to XML may not always 
the best approach to support name 
authority control in XML metadata. 

Endeavors for Cooperative 
Searchable XML Name Databases 

Since the early 2000s, libraries and 
other institutions have attempted to 
create a national searchable XML 
name repository. Shared XML name 
repositories try to harvest name 
authority data from different sources 
distributed throughout the country 
and make it interoperable between 
different institutions. One example of 
such an attempt is the OCLC Linked 
Authority File Project (http://alcme 
.oclc.org/laf), an endeavor between 
the Open Archives Initiative and the 
OCLC. The Linked Authority File 
(LAF) was developed in 2002, hosts a 
shared server containing LC author- 
ity records and potentially author- 
ity records supplied by others, and 
is intended to provide Web-based 
accessed to interactive and automated 
authority records. This national name 
repository periodically uploads names 
from the LCAF and presents them 
in both MARC and MARCXML for- 
mats. 

Even though the LAF's original 
intention was to harvest names from 
different sources besides the LCAF, 
this has not being done yet. When 
asked if the LAF plans to harvest 
authority data from other sources 
besides LCAF, an OCLC Research 
representative replied that "no further 
development of the system itself is 
planned." 16 No explanation was given 
on why the LAF only harvests author- 
ity data from the LCAF and not from 



other sources, but this may be due 
to the difficulty of making authority 
metadata interoperable between dif- 
ferent institutions, a common problem 
faced by cooperative, inter-institution- 
al databases. Given to the lack of pro- 
motion, this initiative is fairly unknown 
and, consequently, has not been widely 
implemented. 

The Linking and Exploring 
Authority File s project ( http ://xml . cover 
pages.org/leaf.html) was an attempt to 
create a cooperative searchable XML 
database for authority names for the 
European community. It was created 
with the purpose of being accessed 
by anyone, regardless of affiliation, 
who might be interested in name 
authority files from European manu- 
scripts. This three-year project (2001- 
4) was cofunded by the Information 
Society Technologies Program of the 
Fifth Framework of the European 
Commission. 

Linking and Exploring Authority 
Files (LEAF) sought to develop a 
system model that uploaded name 
authorities — distributed through local 
servers of participating European 
organizations — to the central LEAF 
system. Authorities then were con- 
verted and stored into EAC sche- 
ma, with authorities that belonged 
to the same entity being automati- 
cally linked. To have a network where 
those linked records could be applied, 
LEAF was integrated into a search 
engine called Manuscripts and Letters 
via Integrated Networks in Europe 
(MALVINE). MALVINE "is a search 
engine that harvests databases which 
provide information about letters writ- 
ten by famous persons that are kept in 
different European institutions." 17 

After being integrated into the 
MALVINE search engine, the linking 
process of LEAF proved not to be reli- 
able. Kaiser and colleagues stated that 

it is inevitable that in some 
instances the linking pro- 
cess will produce incorrect 
results. Records describing 



two different persons might 
be automatically linked 
because they do not contain 
enough discriminating infor- 
mation. On the other hand, 
two records representing 
the same person might not 
be linked because they do 
not share an identical name 
form. Recollecting the main 
purpose of library authority 
records — the disambiguation 
of persons described — it may 
be argued that those records 
leading to wrong links are not 
sufficiently rich in content 
to serve their original pur- 
pose. 18 

The project ran as a funded test 
for thirty-six months, ending in May 
2004. Thereafter it was left as an inte- 
grated part of the MALVINE search 
engine, where it is still used because it 
is seen as "highly relevant [content] to 
the cultural heritage of Europe." 19 

In theory, using a shareable XML 
name database sounds like a great plan 
for libraries and institutions that already 
have built a local XML name reposi- 
tory because records can be uploaded 
by one entity and shared between 
different institutions. In reality, expe- 
rience has shown that this approach 
does not work because for metadata to 
be successfully harvested by a national 
cooperative repository, all locally cre- 
ated authority metadata needs to be 
"shareable." Shareable metadata are 
metadata that need to follow a set of 
standards to be interoperable between 
different institutions. The standards 
needed to create shareable metadata 
have not yet been established because 
of the lack of cooperation between 
different institutions. Pitti states, "As 
economically and professionally desir- 
able as cooperative, shared authority 
control, and biographical, historical 
description is, successful realization 
will require standards and systems 
that are collaboratively developed, 
administered, and maintained. These 
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standards and systems will have to 
serve both individual and shared inter- 
ests. Successfully balancing competing 
interests will require a great deal of 
patience, goodwill, and intelligence." 20 

Automated Endeavors to Support 
Name Authority Control in XML 

Other projects have sought to solve 
the problem of addressing author- 
ity control in XML records on a local 
scale by implementing automated pro- 
cesses to extract and detect possible 
name access points in XML records. 
For example, in 2003, the Digital 
Knowledge Center (DKC) at John 
Hopkins University explored the appli- 
cation of automating metadata gen- 
eration for name authority control. 21 
To achieve this purpose, the DKC 
created a tool called the Automated 
Name Authority Control (ANAC). 
This automated metadata generator 
applies an established algorithm to 
identify LC-authorized names for 
each name in descriptive metadata 
records. Patton and colleagues stated 
that "The main reason for under- 
taking ANAC was to develop a tool 
that would reduce the costs associ- 
ated with introducing name authority 
control to metadata [because] relying 
exclusively on human catalogers would 
be substantially more expensive and 
time consuming." 22 After evaluating 
the tool, the authors determined that 
the automated system was not suf- 
ficiently reliable in many cases. They 
added, "Even though ANAC could be 
a valuable complement [to authority 
control], it was never anticipated that 
it would entirely replace the human 
effort." 23 The authors concluded that 
the most effective and cost-efficient 
workflow would couple ANAC with 
human oversight. 

Another attempt for automated 
generation of name authority control 
in digitized collections was suggested 
by French, Powell, and Schulman. 24 
They introduced the concept of 
approximate word matching similar to 



the approximate string matching tech- 
niques traditionally used in detect- 
ing variant names in databases. This 
approach detects variable forms of 
strings in names through clustering 
algorithms and then groups the strings 
together under a standard form. The 
authors observed that, even though 
this automated clustering approach can 
reduce human effort by half, a certain 
amount of human effort will always be 
required to verify the output, thus this 
approach is semiautomatic. 

Although systems created for 
the "automated" generation of name 
authority control claim to be automat- 
ed, they are not completely so. They 
are really semi-automated approach- 
es because they will always rely on 
human intervention for the process to 
work properly. Because of the need for 
human intervention and the high cost 
of creating such an endeavor, systems 
for the so-called automated genera- 
tion of name authority control may not 
always be the best approach to support 
XML name authority control in many 
libraries. 

Designing a Practical 
Approach at UTL 

After reviewing the library literature 
and analyzing the advantages and dis- 
advantages of the different approaches 
to support name authority control in 
XML metadata, UTL decided that 
none of these approaches was appro- 
priate for the local situation. Because 
of time and funding constraints and 
lack of technological support, UTL 
decided to design a different approach 
that would be customized for UTL. 
Several points needed consideration. 
First, the new authority control meth- 
od had to be completely sustainable 
by the UTL catalogers. Sustainable 
in this context meant the method had 
to be cost effective and use a level of 
technology with which the catalog- 
ers were comfortable. Because the 
cataloging department could not hire 



a local programmer, they ruled out 
building a local XML name repository 
and decided to capitalize on existing 
staff knowledge instead. Second, the 
authority control had to be achieved 
within a reasonable amount of time. 
Because this work is time consuming, 
priorities for which names to establish 
and which ones to leave out had to be 
set from the beginning. These priori- 
ties will be referred to from now on as 
"establishment criteria." The reason 
for using establishment criteria is that 
searching, verifying, and establishing 
each name found in the TEI records 
would be impossible because they 
number in the thousands. The criteria 
would determine the cases in which 
names would be searched and estab- 
lished in the LCAF Third, the new 
authority method had to support a 
level of quality if UTL wanted to keep 
materials by the same creators togeth- 
er. Given these considerations, UTL 
decided to use a manual approach that 
keeps taking advantage of the larg- 
est name authority file available — the 
LCAF. 

The method UTL implemented 
integrated all the previous points. The 
process is described in detail in the 
following section. Briefly, authority 
control is performed as soon as the 
DLC sends the TEI transcriptions to 
the cataloging department. Authority 
control, then, is performed before 
records are cataloged, and only by 
one person to avoid future inconsis- 
tencies in establishing names. The 
person chosen to perform this task is 
one of the catalogers, who had pre- 
vious experience creating authority 
records through the Name Authority 
Cooperative Project (NACO). After 
authority work is finished, headings are 
stored in a Microsoft Excel shareable 
spreadsheet. As soon as the spread- 
sheet is ready, catalogers are notified 
and TEI records are sent to them. The 
catalogers then have the necessary 
resources to catalog and create rich, 
descriptive MODS records with the 
least amount of effort. 
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The detailed process followed at 
UTL consists of the following steps: 
The librarian charged with authority 
control receives a batch of one hun- 
dred to three hundred TEI transcrip- 
tions with their digital images from the 
DLC. The files usually originate from 
many different collections in the uni- 
versity archives. The authority control 
librarian devotes approximately two 
weeks (full time) to authority work for 
this batch of records. After receiving 
the TEI files, the authority librarian 
opens and browses the files using XML 
Pad. First, she groups the files by col- 
lection name. She identifies the col- 
lection to which a TEI file belongs by 
looking at the tag < collection > under 
the <sourceDesc> in the TEI file or 
by looking at the second portion of 
the TEI identification number. In the 
example 0012_000060_000337_0000, 
"60" identifies the collection to which 
the TEI belongs. This is illustrated 
in figure 1. (The zeros serve as fill 
characters and are omitted by the 
authority librarian when noting these 
data in working files.) By grouping 
TEI files by collection, catalogers will 
start working with all TEI files in one 
collection before moving to the next 
one. The logic behind this approach 
is to become familiar with the finding 
aid of a single collection by reading it 
once instead of having to read it many 
times. Another reason is that names in 
TEI files within one collection usually 
relate to each other, making keeping 
track of family and other types of rela- 
tionships easier. 

Within each TEI file, the author- 
ity librarian browses the following sec- 
tions to search for names: 

• Title section, to retrieve 
the names of senders and 
recipients. 

• Body section, to find names that 
"pop out" as important access 
points. If summary sections are 
provided, the librarian should 
browse through them as well. 

• Signature section, to determine 



the preferred form of heading 
for the sender. 

Names found in these sections 
become crucial because they will form 
access points for the MODS record, 
the equivalent of the fields lxx (main 
entry fields), 6xx (subject access), and 
7xx (added and linking entry fields) in 
MABC records. The authority librar- 
ian should pay atten- 
tion to variant forms 
of headings as well. 

The author- 
ity librarian makes 
a list of the names 
found, as well as of 
their variant forms, 
along with the 
record number of 
the TEI where the 
names were found. 
By recording this 
number, the librar- 
ian can later retrieve 
the exact location of 
these names in case 



more information is needed. In going 
through the rest of the TEI files, the 
authority librarian may encounter the 
same names, as well as other new 
names or variant forms of names, and 
keeps a list. This process is illustrated 
in figure 2. 

After browsing the TEI files in 
one collection, the authority librarian 
looks at the names gathered so far and 




<TEI id="0012_000060_000337_0000"> 
<titleStmt> 

<sourceDesc> 

<"t((//t't7['y« ">Lina Smith Family Pa|i&K$^ 
<"kolhction"> 



Collection's Name 



Collection's Number 



Figure 1. Sections to Browse in the TEI File to Determine the 
Collection to which the TEI File Belongs 



Collection "Lina Smith Family Papers" 1845-1897 



TEI 

<TEI id="001 2_000060^)00337_0000": 

<tUUStmt> 

Letter, Lina Smith in Memphis, TN, I 
Hannah Smith, in Charlotte, N.C. 
<JtithStmt> 
<M«i/r('/J('\c>3 pawl's 

<body> 

John 'I', and 1'rauk Lemur harked nut 
from the trade. The reason they gave ... 



<p>You 



truly, I 



a S.</p> 



1 



-Body, 



..Signature 




nT ei 

<TEI id="0012_OOo9BO_00034Q_0000 "; 

<titkStti 

■ Letter, Hannah Smith in Charlotte, N.C, 
to Lina Smith, in Memphis, TN 
<ftitUStmt> 
<sourct'Ih'sc>2 pages 



<Jsm 



ceDest 



:ceived your letter yesterday. My 
daughter Portia got silk Monday with... 

, <p>Love, H. S.</p> 
<Jbody> 



Names found in TEI records 



2. Hannah Smith -> TEI # 12-60-337 & TEI # 12-60-340 

H. S. -> Variant form TEI #12-60-341 

3. JohnT. -> TEI # 12-60-337 

4. Frank Lenoir TEI # 12-60-337 

5. Portia^ TEI #12-60-340 



Figure 2. Sections to Browse in the TEI File to Get Names and How to Properly Keep a 
Record of Them in a List 
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compares them to see if some names 
have been mentioned more than once 
using the same form or a variant one. 
She also counts the number of times 
each name is mentioned in different 
TEI files. 

The establishment criteria are 
then applied to names in that par- 
ticular collection. These criteria help 
determine which headings will be 
searched, verified, or established, and 
which ones will not. UTL developed 
establishment criteria that worked well 
in most situations. Names mentioned 
in at least three separate TEI files are 
searched in the LCAF and established 
if not found. The same process applies 
to names mentioned in the "Title" 
section of the TEI files as senders or 
recipients and to names that have a 
collection with their name (this can 
be checked in the "Collection" sec- 
tion of the TEI file). The one excep- 
tion to the establishment criteria is 
the handling of names for prominent 
historical individuals. Because they 
are likely to appear in the LCAF, 



they are also searched. Figure 3 illus- 
trates application of the establishment 
criteria. 

For names that will be established 
according to the criteria, the authority 
librarian returns to the TEI files in 
which they were found. This is done 
by using the TEI record number that 
was noted on the list. When retrieving 
the TEI records, the librarian browses 
the text around the area where the 
names were found to get as much 
information — stated directly or indi- 
rectly — about the person as possible. 
Examples of useful areas to browse in 
TEI files are the "Title" section, which 
gives the date and place a letter was 
sent, and the "Body" section, which 
may provide information on people's 
roles, relationships, and so on. The 
authority librarian annotates this infor- 
mation, along with the variant forms 
of the name found. The result might 
look like this: 

Jacob Breck, Jab Breck, 
Jacob B.; sender of letter 



from Franklin, TN in 1864 
to Philadelphia, Penn; judge; 
wife Lizzie. 

These brief factual data will provide a 
general idea of who this individual was 
and when he or she lived. The data 
found can be expanded later through 
further research in outside sources. 

After all biographical facts avail- 
able in TEI files have been annotated, 
the authority librarian then consults 
various research tools such as finding 
aids. The University Archives, which 
own the original texts for the TEI files, 
have created finding aids, many of 
which are online. Tennessee state and 
county archives also may contain relat- 
ed finding aids. These tools may pro- 
vide information on the person's time 
period, family, place of residence, and 
more details. This information will be 
used by the authority librarian to place 
this person in context, see with whom 
he or she associated, and differentiate 
the individual from others with similar 
names when searching the LCAF. 

If finding aids do not provide 
enough information about a per- 
son or are not available, the librar- 
ian searches other outside sources 
such as Google Book Search. This 
tool provides the ability to search 
sections within long texts of reliable 
resources that are freely accessible 
online. Other useful and freely acces- 
sible websites for historical biographi- 
cal research include the Political 
Graveyard — A Database of Historic 
Cemeteries (http://politic algr avey ard 
.com), the state finding aids via the 
state library or state historical soci- 
ety websites, Genealogybuff.com, the 
Biographical Directory of the United 
States Congress (http://bioguide 
.congress.gov/biosearch/biosearch 
.asp), and the Civil War Posters 
website (www.geocities.com/Area51/ 
Lair/3680/cw/cw.html). The latter can 
be searched by soldier, regiment, and 
more. In addition, the authority librar- 
ian searches the Tennessee Genealogy 
and History Web (TnGenWeb. 



<Jacob Breck> 




<sourceDesc> 
<body> 

<Jacob Breck>^ 



TEI #3 

<sourceDesc> 
<body> 

<Jacob Breck>p ^ 



1) Names mentioned in 3 or more different TEI records 



TEI 

<TEI id="001 2_000074_0001 56_0000"> 

<tilteSlml> 
Letter, Ronald Davis in Clint, Fl., to Helen 
Krutz, in Charlotte, N.C. 
<JtitkStmt> 
<sourceDeso 3 pages 

<"Collection"> John Krutz Papers M 

<JsoitrceDeso 
<body> 

My dear sister I've been 

<p> Love, Ron Davis<^» 
<Jbody> 



2) Names mentioned in 
"Title" section of TEI records 



3) Names mentioned in "Collection" 
section of TEI records 



Figure 3. Application of Establishment Criteria for Names 
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org) — other states may have similar 
sources. Other fee-for-service geneal- 
ogy databases are also available. When 
searching historical personal names 
online, a useful tip for best results is to 
search using very specific factual data. 
For example, if the only information 
available from a TEI transcription 
about "Lord Cornwallis" indicates 
that he was alive during 1791 and 
wrote from Blount County, these facts 
should be integrated into the search. 

As relationships between indi- 
viduals start becoming clearer, the 
authority librarian should illustrate 
the relationships using visual aids in 
addition to noting the information. 
Visual aids such as genealogical trees, 
arrows, and diagrams can prove use- 
ful to represent relationships between 
individuals. Visual aids are important 
for the authority librarian because 
she may need to consult these aids to 
create names, and they are important 
for the rest of the metadata team, who 
will assign the names as access points 
in the MODS records. Understanding 
relationships among the individuals in 
the digitized transcriptions is crucial 
to create useful access points for the 
records. 

After gathering enough data about 
a particular individual, the librarian 
searches the name in the LCAF. At 
this point, she will have enough bio- 
graphical information to distinguish 
that individual from others with simi- 
lar names in the LCAF. If the heading 
is found in the LCAF, the authority 
librarian copies and pastes the head- 
ing into a local Excel spreadsheet, 
along with its cross-references and 
notes. The spreadsheet serves two 
purposes: It builds a local database of 
the established names found in UTLs 
digitized archives and provides cata- 
logers with a narrower list of estab- 
lished names that appear in the TEI 
files they will catalog, saving them the 
time and effort of searching the LCAF. 
Sometimes, the authority librarian has 
located extra information not already 
mentioned in the LCAF about the 



individual listed. This extra informa- 
tion, such as biographical details or 
other variant forms of names, can be 
added optionally to the LCAF record 
in order to enhance it. This additional 
information can help differentiate this 
person from others with similar names 
in the future. 

If the heading is not found in 
the LCAF, the librarian searches the 
OCLC Connexion Bibliographic File 
for records that used this name in 
any of their access points. To search 
in these areas of the bibliographic 
files, the authority librarian performs 
keyword searches using the index 
labels "au" (author) and "su" (subject). 
Searching in the OCLC Bibliographic 
File is a step required before estab- 
lishing any heading in the LCAF. This 
search often leads to records that 
mention variant forms of this person's 
name as well as extra facts not discov- 
ered previously. 

After searching the OCLC 
Bibliographic File, the authority librar- 
ian establishes headings that were not 
found in the LCAF using the bio- 
graphical information gathered to this 
point. Headings can be established 
locally or nationally, depending on the 
institution's involvement with NACO. 
Libraries that are NACO members 
or part of a NACO Funnel Project 
have the option of making name con- 
tributions nationally. A NACO funnel 
project is a group of libraries who 
together are authorized to contrib- 
ute name authority records to the 
LCAF. On the other hand, libraries 
that are not NACO members will not 
have the option of making national 
name contributions and will have to 
establish them locally. UTL has the 
option of making name contributions 
to the LCAF because it is a mem- 
ber of the Tennessee NACO Funnel 
Project. When establishing a head- 
ing, the authority librarian includes all 
the cross-references and factual data 
found previously in the research that 
may prove useful for the future. After 
establishing a heading in the LCAF, 



the authority librarian copies and 
pastes the heading into the local Excel 
spreadsheet with all the other LCAF 
names already found in OCLC. 

After the chosen headings from 
one particular collection have been 
searched and established, the author- 
ity librarian browses the TEI files 
of the next collection, repeating the 
steps described above until all col- 
lections in the batch of TEI files are 
completed. 

After names that met the estab- 
lishment criteria have been searched 
or established, the lists of names that 
did not meet the establishment cri- 
teria remain. These lists are kept by 
the authority librarian in case any 
of the names need to be established 
in the future. Each list contains the 
TEI record numbers indicating where 
those names were found and can help 
retrieve the records if they are needed 
later. 

After the authority librarian com- 
pletes these steps, the authority work 
is considered completed. The tools 
and resources needed for cataloging 
metadata are then placed in a share- 
able department server. These include 
the digitized files in JPEG, transcrip- 
tion files in TEI, visual tools, and the 
Excel spreadsheet with the authorized 
name headings. The catalogers are 
then prepared to start creating MODS 
descriptive metadata with the least 
amount of inconvenience. 

Implementing this authority con- 
trol process before the rest of the 
metadata production starts solved the 
problems UTL initially faced when 
trying to assign name access points 
to MODS records without authority 
control. This approach to authority 
control solved both the difficulty in 
finding TEI names in the LCAF and 
the inconsistency in establishing names 
if they were not found there. Now 
that the authority librarian provides 
all the necessary authority work, the 
catalogers will not have to worry about 
searching these names in the LCAF or 
establishing them. The catalogers will 



50 Veve 



LRTS 53(1) 



find the established forms plus their 
variants in a local, shared Excel list. 

Placing authority control before 
the rest of the metadata process per- 
mitted the catalogers to focus on the 
rest of the description. It solved the 
difficulty of differentiating individuals 
with very similar names within a collec- 
tion by providing useful biographical 
information. The use of qualifiers and 
other attributes in authority control 
also helped in this purpose. The provi- 
sion of visual aids such as genealogical 
tables helped catalogers throughout 
the process of visualizing family rela- 
tionships and helped to diminish con- 
fusion about similar names. 

The problem of misspelled names 
and other typographical errors that 
occurred when transcribing names 
from the original text to the TEI 
files was also solved with this author- 
ity method. By receiving the TEI 
files with their digitized images as a 
first step, the authority librarian had 
the opportunity to catch transcrip- 
tion mistakes and fix them before the 
catalogers had the chance to discover 
them. 

Assessing the Effectiveness of 
UTL's Approach 

To assess the effectiveness of this 
approach, UTL decided to compare 
the metadata workflow before having 
authority control with the workflow 
after implementing authority control. 
UTL performed an informal assess- 
ment through a questionnaire, asking 
the six catalogers who experienced the 
first workflow without authority con- 
trol to compare particular production 
aspects within both workflows. The 
questionnaire was distributed three 
months after the implementation of 
authority control into the metadata 
workflow and consisted of ten closed 
questions and one open question to 
provide suggestions. The question- 
naire is presented in the appendix to 
this paper. 

In the questionnaire, the six 



catalogers were asked if the speed of 
producing MODS records improved 
after the implementation of pre- 
cataloging authority control. All six 
agreed that the speed of producing 
MODS records was higher after the 
implementation of authority control. 
When asked to estimate the num- 
ber of MODS records produced per 
week before the implementation and 
the number produced per week after 
the implementation, they reported 
a much higher number of MODS 
records produced per week after the 
implementation. The six catalogers 
responded that before the imple- 
mentation, an average of five or less 
records were produced per week; 
after the implementation, five catalog- 
ers reported an average of ten or more 
records produced per week and one 
cataloger reported six to nine records 
per week. 

Catalogers were asked if the pro- 
vision of authority control, before they 
began metadata work, freed them 
to concentrate on other important 
descriptive metadata tasks such as 
assigning subject headings, writing 
summaries, and analyzing the TEI 
record. To this question five of the six 
catalogers responded yes, the provi- 
sion of authority control freed them 
to perform other important metadata 
tasks; one cataloger answered that it 
made no difference. Concerning qual- 
ity of MODS records produced, all 
six agreed that the quality of MODS 
records improved after the implemen- 
tation of authority control. Reasons for 
the quality improvement of MODS 
records were that more controlled 
access points were available than 
before the process changed, and that 
they were more consistent. Five of 
the six catalogers agreed that MODS 
records were more difficult to cre- 
ate before having the new-approach 
authority control. Reasons given to 
explain this difficulty before having the 
new approach were that there were 
inconsistencies in names established, 
distinguishing different persons with 



similar names was more difficult, and 
no visual tools were available to clarify 
relationships between individuals. Of 
the six, only one cataloger reported 
that the difficulty of creating MODS 
records was the same before and 
after the implementation of authority 
control. 

Future Plans 

While UTL's informal assessment 
demonstrated the effectiveness of this 
authority method in improving the 
MODS metadata production work- 
flow, it also showed aspects that need 
improvement and issues that will need 
to be addressed in the future. In the 
suggestions at the end of the assess- 
ment, two catalogers showed concern 
about what will happen to the meta- 
data workflow if the authority librar- 
ian leaves. To solve this, UTL will 
eventually need to expand and del- 
egate authority control tasks to other 
members in the cataloging depart- 
ment so that authority control does 
not depend on one person's contribu- 
tions. Initially, some authority control 
responsibilities, such as research tasks, 
can be delegated to members within 
the cataloging department. Eventually 
this responsibility can expand, with the 
catalogers creating personal authority 
records. They will need training either 
from the local authority cataloger who 
has NACO experience or through 
the closest NACO Funnel Project. 
Both alternatives would require ini- 
tial time investment by the staff and 
institution, but this option could help 
make the workflow run more smoothly 
and to cover for the person perform- 
ing authority work in case he or she 
leaves. 

Another issue identified through 
the questionnaire was the increas- 
ing difficulty of searching names with 
many cross-references in Excel. As the 
number of names with cross-referenc- 
es increases, so does the difficulty in 
handling them effectively by the Excel 
software. Excel was not designed to 
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handle information arranged in the- 
sauri format but primarily to han- 
dle numerical data. For this reason, 
commercial software that is better 
suited to handle cross-references will 
be needed in order to substitute for 
Excel. Thesauri software, which is 
software designed to build and edit 
thesauri headings, can manage cross- 
references very well and is cheaper 
than hiring a programmer to build an 
XML repository. Thesauri software is 
available in standalone packages and 
as database modules, which are inte- 
gral parts of larger systems and need 
to run with them. Examples of popu- 
lar standalone packages are MultiTes, 
Data Harmony, a.k.a. Classification 
Software, STRIDE, and Term Tree 
2000. Examples of database modules 
are STAR and The Ma Thesaurus 
Manager for Oracle. Using thesauri 
software is an economical and attrac- 
tive option to store and manage local 
authority names and one that UTL will 
begin to explore. 

Conclusion 

As evidenced throughout this paper, 
many libraries and institutions are 
looking for ways to turn necessary 
tasks over to machines, but experi- 
ence suggests this is not yet possible 
for name authority control in XML 
metadata. The efforts created so far to 
achieve this goal, besides being costly 
and work intensive, have proved to 
be ineffective and unreliable. Most 
do not address the issue of how to 
extract or harvest names directly from 
the XML records and transform them 
into useful access points, but focus 
on how to encode the access points 
into XML authority schema. The few 
endeavors that have tried to harvest 
names directly from XML records 
have proved not to be completely 
reliable in their processes of match- 
ing and linking names, making them 
dependent on human effort. 

In addition to not addressing how 



to select and extract access points 
from the XML records, most of these 
endeavors require labor-intensive 
encoding of authority data into XML 
schemas and, subsequently, the cre- 
ation of a local XML name repository 
to store and manage these records. 
Building an XML name repository is a 
task that requires a high level of tech- 
nological background most catalogers 
lack. For this reason a programmer 
will have to be hired to build a name 
repository, and this is an expensive 
approach not many libraries can pur- 
sue. Furthermore, creating authority 
data to be stored in a local repository 
will only benefit the local institution, 
causing inconsistencies and duplica- 
tion of efforts between different insti- 
tutions that try to set up access points 
for the same individuals. Initiatives 
that tried to avoid the duplication of 
efforts in name authority control — by 
creating a national XML name reposi- 
tory to share authority data and make 
it interoperable between different 
institutions — have not been successful 
because the XML authority data needs 
to be shareable to be interoperable 
between the national repository and 
the other institutions. To date, this has 
not been successfully achieved. 

In contrast to these approach- 
es, UTLs method to support name 
authority control in XML metadata is 
effective, reliable, and cost effective. It 
addresses the issue of extracting names 
directly from the XML documents and 
turning them into useful access points 
that can be shared nationally through 
the LCAF, thus avoiding duplication of 
efforts and benefiting all libraries who 
may share the same access points. 

UTLs approach is simple and can 
be used by other libraries and institu- 
tions that face similar issues when 
trying to support name authority con- 
trol in their XML metadata. Common 
problems such as inconsistency in the 
establishment of names, difficulty in 
differentiating individuals, and decid- 
ing which names to turn into access 
points can be solved by implementing 



this method before creating any 
descriptive metadata for digitized 
transcriptions. Regardless of the local 
XML schema used, this approach can 
be applied in the same way to different 
collections. 



References 

1. Library of Congress, Metadata Object 
Description Schema (MODS), 
MODS Schemas, www.loc.gov/stand 
ards/mods (accessed June 28, 2008). 

2. Sally H. McCallum, "MARC/XML 
Sampler," International Cataloguing 
i? Bibliographic Control 35, no. 1 
(Jan./Mar. 2006): 4. 

3. Ibid., 6. 

4. Rebecca Guenther, "MADS," 
Computers in Libraries 27, no. 4 (Apr. 
2007): 14. 

5. Ibid. 

6. Alexander C. Thurman, "Metadata 
Standards for Archival Control: An 
Introduction to EAD and EAC," 
Cataloging ir Classification Quarterly 
40, no. 3/4 (2005): 184. 

7. Ibid., 199. 

8. Daniel V. Pitti, "Creator Description: 
Encoded Archival Context," 
Cataloging ir Classification Quarterly 
38, no. 3/4 (2004): 217-18. 

9. McCallum, "MARC/XML Sampler," 
4. 

10. Valid Documents: XML Semantics, 
DTD, en.wikipedia.org/wiki/XML 
(accessed July 5, 2008). 

11. McCallum, "MARC/XML Sampler," 
4. 

12. Ibid., 5. 

13. Ibid. 

14. "Challenges and Issues with Metadata 
Crosswalks," Online Libraries ir 
Microcomputers 20, no. 4 (Apr. 2002): 
1-4. 

15. Ibid. 

16. Jeff Young, "RE: OCLC LAF ques- 
tion," e-mail to author, July 9, 2008. 

17. Jutta Weber, "LEAF: Linking and 
Exploring Authority Files," Cataloging 
ir Classification Quarterly 38, no. 3/4 
(2004): 230. 

18. MaxKaiseretal., "New Ways of Sharing 
and Using Authority Information: 
The LEAF Project," D-Lib Magazine 
9, no. 11 (Nov. 2003), www.dlib.org/ 



52 Veve 



LRTS 53(1) 



dlib/november03/lieder/lllieder.html 
(accessed July 6, 2008). 

19. LEAF Project Synopsis, www.crxnet 
.com/leaf/info.html (accessed July 7, 
2008). 

20. Pitti, "Creator Description: Encoded 
Archival Context," 218. 

21. Mark Pattonetal., "Toward a Metadata 



Generation Framework: A Case 
Study at Johns Hopkins University," 
D-Lib Magazine 10, no. 11 (Nov. 
2004), www.dlib.org/dlib/novem 
ber04/choudhury/llchoudhury.html 
(accessed March. 27, 2008). 

22. Ibid. 

23. Ibid. 



24. James C. French, Allison L. Powell, 
and Eric Schulman, "Using Clustering 
Strategies for Creating Authority 
Files," Journal of the American 
Society for Information Science 51, 
no. 8 (June 2000), www.cs.virginia 
.edu/papers/Using_Clustering.pdf 
(accessed March 27, 2008). 



Appendix. Comparison of Metadata Workflow Before and After 
Implementation of Authority Control 



1) Do you think the speed of producing MODS records 
was higher? 

a) Before the provision of authority control 

b) After the provision of authority control 

c) It was the same before and after 

2) Do you think the quality of MODS records produced 
was better? 

a) Before the provision of authority control 

b) After the provision of authority control 

c) It was the same before and after 

3) If you answered "after" to the previous question, why 
do you think the quality of MODS records was better 
after the implementation of authority control? Choose 
all that apply: 

a) Because there were more controlled access points 

b) Because access points were consistent between 
records 

c) Because records were more accessible to users 

d) None of the above 

4) Do you think the production of MODS records was 
more difficult? 

a) Before the provision of authority control 

b) After the provision of authority control 

c) It was the same before and after 

5) If you answered "before" to the previous question, why 
do you think it was more difficult to produce MODS 
records before the provision of authority control? 
Choose all that apply: 

a) Because there were inconsistencies in names 
established 

b) Because it was harder to distinguish different per 
sons with similar names 

c) Because there were no visual tools available to 
understand relationships between persons 

d) None of the above 

6) Do you think the provision of authority control for 



names in metadata records frees you to concentrate in 
other important tasks such as assigning subject head- 
ings, writing an abstract, or analyzing the TEI records? 

a) Yes 

b) No 

c) It makes no difference 

7) Do you think the provision of authority control for 
names before MODS are produced improves the meta- 
data workflow in general? 

a) Yes 

b) No 

c) It makes no difference 

8) On average how many MODS records did you create 
per week before the implementation of authority con- 
trol into the metadata workflow? 

a) More than 10 

b) 6-9 

c) 5 or less 

9) On average how many MODS records did you create 
per week after the implementation of authority control 
into the metadata workflow? 

a) More than 10 

b) 6-9 

c) 5 or less 

10) In which aspects of authority control would you like to 
see more improvement? Choose all that apply: 

a) Searching names in Excel spreadsheet 

b) Illustration of visual aids 

c) Time for authority control to be ready 

d) Other, please explain: 

e) None 

11) Do you have any additional comments or insights 
regarding authority work for the metadata workflow? 
(For instance, recommendations for workflow, tools 
improvement, adjustments, and so on?) 

Thank you for taking the time to answer the questionnaire! 
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Notes on Operations 

Using Batchloading to Improve 
Access to Electronic and 
Microform Collections 

By Rebecca L. Mugridge and Jeff Edmunds 



Batchloading bibliographic records into the catalog, as a rapid and cost-effective 
means of providing access to electronic and microform collections, has become in 
recent years a significant workflow for many libraries. Thanks to batchloading, 
previously hidden collections, some costing hundreds of thousands of dollars, are 
made visible, and library holdings are more accurately reflected by the online cata- 
log. Subject specialists report significant increases in the use of electronic resources 
and microforms within days (and sometimes only hours) of loading record sets 
into the online catalog. Managing batchloading projects requires collaboration 
across many library units, including collection development, acquisitions, catalog- 
ing, systems, and public services. The authors believe that their experiences will 
be instructive to other libraries and that Penn State's processes will assist them in 
making their own batchloading policies and procedures more efficient. 

In the age of Google, when digital natives expect everything — or almost 
everything — to be discoverable online, libraries face the ever more daunting 
task of providing title-level access to online resources in their catalogs. Providing 
access to large microform and digitized collections for which no or only limited 
(i.e., collection-level) access in the public catalog exists is similarly challenging. 
Batchloading bibliographic records into the catalog is a rapid and cost-effective 
means of meeting these challenges. 

Given its cost-effectiveness and the wide availability of record sets describ- 
ing large collections, batchloading has become a significant workflow for many 
libraries. As more print resources are digitized, more born-digital projects cre- 
ated, and metadata becomes easier to convert and repurpose for bibliographic 
description, Machine-Readable Cataloging (MARC) records for more collections 
are likely to become available. Such record sets can be expensive, but given the 
immense improvement in access to collections they provide compared to a single 
collection-level record, they are often worth the price. 

Some vendors supply MARC records as part of the packages they sell, real- 
izing that libraries may be more likely to purchase or license a resource when they 
know that bibliographic records will ensure that individual titles in the collection 
are discoverable in the catalog. In fact, some institutions, individually or in con- 
cert, may find that lobbying vendors to make records available for every resource 
they sell is advantageous. Use of electronic resources is inextricably linked to 
discoverability, and evidence suggests that title-level records in a library's cata- 
log increase use. At Penn State University Libraries, subject specialists report 
significant increases in use of electronic resources and microforms within days 
(and sometimes within hours) of loading record sets. With each batchloading of 
records, previously hidden collections are made visible, and the vast richness of 
the libraries' holdings is more accurately reflected by the catalog. 

Managing the process of batchloading requires collaboration across several 
library units. Acquisitions staff work with subject specialists and budget officers to 
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negotiate with vendors and purchase 
resources. Collection development 
librarians decide which files to pur- 
chase and set priorities for the order in 
which to load files. Public services staff 
review records to ensure their constit- 
uents' needs are being met. Cataloging 
staff assess record quality, customize 
record sets to meet local needs, and 
coordinate loads. Systems staff load 
records and manage the extraction of 
records for vended authority control. 

Penn State University Libraries 
have devoted substantial financial and 
staff resources in transforming batch- 
loading (originally a small-scale, proj- 
ect-based activity) into a standardized, 
institution-wide workflow. We believe 
that our experiences will be instructive 
to other libraries and that Penn State's 
documentation will assist others in mak- 
ing their own batchloading policies and 
procedures more efficient. This paper 
discusses the management of ad hoc 
batchloading; ongoing regular MARC 
record loads, such as PromptCat or 
Marcive, which at Penn State occur 
on a biweekly or monthly basis and 
are largely automated, fall outside the 
scope of the present discussion. 

Survey of Literature on 
Batchloading Bibliographic 
Records into the Online 
Catalog 

The OCLC began working with librar- 
ies and other vendors in the 1980s 
to promote the shared cataloging of 
microform collections and to provide 
sets of bibliographic records for batch- 
loading purposes. 1 Benefits to catalog- 
ing libraries would be free searching 
and setting of holdings symbols and 
complete sets of the bibliographic 
records that they create or enhance. 
Benefits to other libraries would be the 
ability to acquire entire sets of records 
for discrete collections of microform 
library resources. 

Several projects to catalog 
collections for the OCLC Major 



Microforms effort have been docu- 
mented. Myers described the 
University of Southern Mississippi's 
project to create records for the 
Slavery Pamphlets Collection and 
indicated that a major consideration in 
support of the project was the antici- 
pated high use of the collection after 
title-level access would be available in 
the catalog. 2 Toombs addressed the St. 
Louis University project to catalog the 
Nineteenth-Century Legal Treatises 
Microfiche Collection, noting that 
the project added many unique titles 
to the OCLC catalog. 3 Participation 
by St. Louis University in coopera- 
tive cataloging programs such as the 
Library of Congress Name Authority 
Cooperative Program (NACO) and 
OCLC Enhance has benefited all 
other libraries who use the records 
subsequently. 

Jones described the development 
of microforms cataloging projects to 
create record sets to provide to librar- 
ies as well as efforts at Florida State 
University to batchload records for 
OCLC Major Microforms sets into 
their NOTIS online catalog. 4 He 
reported that OCLC provided record 
customization options for record sets, 
including the addition of a call num- 
ber; however, that feature could be 
improved by increasing the detail 
added to the call number. Nevertheless, 
he found that the addition of records 
to the online catalog greatly increased 
the use of microform resources. Dodd 
described Virginia Tech University's 
experiences with batchloading record 
sets for microform collections into the 
Virginia Tech Library System. 5 She 
described the need for flexibility and 
discussion and highlighted the need 
for cooperation between the cataloging 
unit and the automation department. 
Banerjee reported on Oregon State 
University's experiences batchloading 
records for two major microforms sets 
into their online catalog. 6 He stressed 
the need to analyze record quality 
before loading and suggested limited 
criteria for record review and analysis. 



He also recommended allowing time 
for problem resolution and clean-up 
after the records are loaded. 

Martin described the chal- 
lenges associated with the cataloging 
of eBooks, including the source of 
cataloging records, the potential for 
batchloading, the question of wheth- 
er holdings for print and electronic 
should be on the same record, edits 
that might be needed before record 
loading, ongoing maintenance, and 
adding holdings for eBooks to OCLC. 7 
She also addressed the increased use 
associated with eBooks records' avail- 
ability in online catalogs, citing a num- 
ber of other studies that indicate that 
the cataloging of eBooks increases use 
dramatically, in one case as much as 
755 percent. Many of the issues identi- 
fied and concerns expressed in these 
articles still exist for libraries today, 
whether loading records for microform 
or electronic resources. 

Background of Batchloading 
at Penn State 

In 2001, in response to a large num- 
ber of requests from subject special- 
ists that bibliographic record sets be 
loaded into the online catalog (the 
CAT), Penn State's assistant dean for 
technical and access services con- 
vened a working group charged with 
overseeing the batchloading process 
(see appendix for the change to this 
group). The Bibload Working Group 
(Penn State's integrated library system, 
SirsiDynix's Unicorn, requires the use 
of a report called "bibload" for batch- 
loading bibliographic records into the 
catalog) meets monthly and includes 
representatives from Cataloging and 
Metadata Services, Public Services, 
the Commonwealth Campus Libraries 
(representing twenty-two Penn 
State campuses located throughout 
the state), and the Department for 
Information Technologies. Originally 
chaired by the assistant dean for 
Technical Services, the Bibload Group 
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was subsequently chaired by the head 
of Cataloging and Metadata Services, 
and is now led by the cataloging and 
metadata specialist, whose position 
description was rewritten in 2005 to 
include primary responsibility for man- 
aging the batchloading workflow. The 
responsibilities of the group's mem- 
bers and chair have been documented 
and are made available to potential 
members before they agree to serve so 
that they have a clear understanding 
of what work and time commitment is 
expected of them (four hours per week 
for members, up to thirty-two hours 
per week for the chair). Managing the 
batchloading process requires a solid 
grounding not only in traditional cata- 
loging and the fundamentals of bib- 
liographic description, but also in the 
technical aspects of data management 
and systems analysis. Also essential is 
a grasp of how users search for and 
discover resources in an online and 
increasingly networked environment. 

Since 2001, the group has over- 
seen the loading of more than half a 
million records into the CAT. Given 
that Technical Services at Penn State 
manually adds between fifty thou- 
sand and sixty thousand records to the 
catalog in an average year, batchload- 
ing, measured in terms of quantity, 
has doubled the productivity of the 
Technical Services Division. Fourteen 
percent of the records in the online 
catalog were batchloaded since 2001. 

Policy Issues 

In the development of any new work- 
flow, libraries encounter issues that 
may require extensive discussion 
resulting in policy decisions. Those 
decisions that affect access, the quality 
of the database, or workflow that cross- 
es organizational boundaries require 
broad input and are best made with 
consensus. The batchloading workflow 
has been no exception, and a num- 
ber of questions have arisen during 
the development of this workflow at 



Penn State. They include issues such 
as record quality versus access; single 
versus multiple records for materials 
held in print, microform, or electronic 
formats; what protocols or standards 
will be established to record deci- 
sions; which level of staff can do what 
work; whether the records should be 
purchased or simply downloaded from 
OCLC; and who will make these and 
related decisions. 

Record Quality versus Access 

Balancing record quality and improve- 
ment to access remains one of the 
biggest challenges in the batchloading 
process. Ideally, all records loaded into 
the catalog should conform fully to 
national and local standards. In prac- 
tice, this is impossible. Few records 
sets are perfect and, in cases where 
the records are felt to be substandard 
in ways that might seriously affect the 
library's services or workflows, a deci- 
sion must be reached about whether 
to load the files and, if so, how much 
record modification should occur prior 
to loading. 

Also in question is the complete- 
ness of some record sets. Banerjee 
noted in 2001 that a record set pur- 
chased from the OCLC appeared to be 
missing "as many as 500 records — over 
eight percent of the entire collection" 
and Penn State recently encountered 
a similar situation. 8 Such experiences 
demonstrate that loading large record 
sets cannot ensure accurate coverage 
of collections to the same extent that 
on-site, title -by-title cataloging can. In 
some cases missing records likely go 
unnoticed for years, meaning that col- 
lections thought to be fully described 
in the catalog are not. Without com- 
mitting resources to painstaking and 
time-consuming post-load qual- 
ity checks, avoiding such oversights is 
nearly impossible. 

Penn State's policy is to favor access 
over record quality. If the "greater 
good" is served by loading the records 
into the online catalog, then they are 



loaded. However, as will be described 
later, much effort goes into improv- 
ing the records through the use of 
MarcEdit software. Penn State's policy 
is to consult subject specialists during 
the decision to load the records and 
during the record enhancement stage. 

Format Duplication, 
Multiple versus Single Records 

The practice of maintaining a single 
bibliographic record for multiple ver- 
sions of a given resource is common, 
even though such practice has, at 
various times, conflicted with national 
cataloging standards. Under such a 
policy, often grounded in a library's 
belief that users prefer to see hold- 
ings in multiple formats on the same 
record, a single catalog record might 
describe not only a printed book, but 
the microform reproduction and a 
digital version available online. 

Both batchloading and the avail- 
ability of many e-resources from mul- 
tiple sources have made this policy 
increasingly difficult to justify or main- 
tain. While standard numerical fields 
in bibliographic records such as the 
ISBN, ISSN, or Library of Congress 
classification number allow a certain 
degree of record matching, in the 
absence of unique and universally 
recognized record identifiers, most 
integrated library systems are simply 
unable to prevent duplication with 100 
percent efficiency. Because effective 
de-duplication is not feasible, loading 
multiple records for different versions 
of a resource and sometimes for the 
same resource supplied by different 
vendors becomes necessary. In addi- 
tion, the relatively recent availability of 
e-journal link resolver services such as 
ExLibris's SFX, many of which require 
the monthly loading of records that 
duplicate records already in a library's 
catalog, has made record duplication 
commonplace. 

On a positive note, keeping each 
load separate facilitates the batch 
removal of items should the library 
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cease subscription to a given collec- 
tion. It also makes possible setting 
better and more accurate holdings in 
the OCLC, thus facilitating the inter- 
library loan process and potentially 
setting the stage for network-level 
resource discovery services, such as 
WorldCat Local. 

Record Keeping and 
Documentation of Practices 

The batchloading process is inher- 
ently complex, involving staff from 
throughout the organization and 
sizable amounts of technical data. 
Detailed record keeping is essential, 
both as a means of keeping stake- 
holders informed and of documenting 
practices so that complex procedures 
and solutions need not be devised and 
reformulated repeatedly. Such record 
keeping will improve the chances for 
success of a process that is so heavily 
distributed throughout the organiza- 
tion. The Bibload Group's website 
(www.libraries.psu.edu/tas/cataloging/ 
dept/bibloads/bibload.htm) describes 
the group's charge, lists group mem- 
bers, and provides links to documen- 
tation. Detailed minutes of monthly 
Bibload Group meetings are taken by 
the chair, circulated for comment and 
correction, and then posted to the 
page. Technical details about each 
load, such as file size, are included, as 
are text versions of each file as well as 
the raw MARC files. Comprehensive 
records of report load specifications 
and load reports generated by the sys- 
tem (which include error logs) for all 
test and production loads accompany 
each file. Finally, Microsoft Word doc- 
uments outlining the analysis of each 
loaded file along with changes made to 
the files prior to load are archived on 
the same page. 

Staffing Levels 

Experience at Penn State quickly 
demonstrated that management of the 
batchloading workflow was best done 



by a central group, with one person 
responsible for coordinating the many 
pieces of the puzzle. Excellent project 
management skills, the ability to follow 
through, and a high level of diplomacy 
are necessary to coordinate a fairly 
complicated workflow that has many 
stakeholders with competing priori- 
ties. Because this activity has become 
such a large and ongoing responsibility 
and includes providing direction to 
both librarians and staff throughout 
the libraries, a high-level professional 
staff position was created from an 
already existing position and given the 
responsibility for managing and coor- 
dinating the entire workflow. 

Batchloading has also resulted in a 
significant amount of post-load work, 
including the correction of records 
that did not load appropriately, cata- 
loging of titles that were missing from 
the files or simply did not load, and 
authorities cleanup. Much of this work 
can be assigned to a lower-level staff 
member in Cataloging and Metadata 
Services, but since the problems 
resulting from different batchloading 
projects can vary from one project to 
another, they generally require some 
direction from the Bibload manager. 
As each load is completed, the cleanup 
required is identified by the manager, 
who drafts procedures to help the staff 
member assigned to make the correc- 
tions. Cataloging knowledge is useful 
for resolving many of the problems 
encountered, so post-load projects are 
usually assigned to an experienced 
copy cataloger. 

Purchasing Record Sets versus 
Downloading from the OCLC 

In some cases the question of whether 
to purchase records as a set from a 
vendor or to download on a title-by- 
title basis from the OCLC is a simple 
one. If the records are provided as a 
proprietary service from a vendor, they 
may not be available in the OCLC; in 
such cases, the only way to provide 
access to those materials is to acquire 



the records from the vendor. If the set 
of records is so large as to be unwieldy 
or impossible to handle on a title-by- 
title basis, the decision to purchase 
as a set is similarly obvious. At Penn 
State, this cutoff point is set at one 
hundred records. If a collection has 
more than one hundred titles and 
records available, we will purchase the 
records as long as funds are available 
to do so. We have found that batch- 
loading projects involving fewer than 
one hundred titles — which, like larger 
loads, still require group input, test 
loads, and systems office resources — 
are not worth pursuing through the 
normal batchloading process. In these 
cases, assuming records are available 
in the OCLC, we have chosen to 
catalog titles individually rather than 
batchloading the records. 

Making Decisions and Getting Input 
from the Right People 

Because anyone who consults a 
library's catalog is potentially affected 
by batchloading, identifying and com- 
municating with stakeholders is criti- 
cal. At Penn State, the Bibload Group 
includes two members from public 
service units, but they cannot, nor are 
they expected to, speak for all of their 
colleagues. Large records sets have 
been loaded for materials in many dif- 
ferent disciplines, including engineer- 
ing, social sciences, statistical data, 
history, literature, medicine, and law. 
Interested parties in the libraries are 
invited to review records and to pro- 
vide input at each step of the process 
for any given load. In especially sig- 
nificant loads, Penn State's Collection 
Development Council, charged with 
coordinating acquisition of materials 
for the libraries, may be consulted. 
Batchloading cannot meet everyone's 
needs perfectly, but broadening the 
pool from which feedback is solicited 
both lessens the possibility of errors 
and heightens awareness of the impor- 
tance of batchloading throughout the 
organization. It is the Bibload Group's 



53(1) LRTS 



Using Batchloading to Improve Access to Electronic and Microform Collections 57 



policy to seek and consider input from 
all stakeholders; this policy is codi- 
fied in procedural documents that the 
group follows for each batchloading 
project. 

Workflow 

The batchloading workflow can vary 
from project to project. This section 
describes the typical workflow of a 
batchloading project, providing exam- 
ples from Penn State's experiences. 

Identification of Available Files 

While the OCLC has, for many years, 
offered MARC records for elec- 
tronic and microform sets through 
its WorldCat Collection Sets service 
(www.oclc.org/worldcatsets/default 
.htm), an increasing number of ven- 
dors of electronic and microform col- 
lections are making MARC record 
sets available for the collections they 
sell. Records are also available from 
commercial cataloging firms such as 
Cassidy Cataloguing Services, based 
in Rockaway, New Jersey, which 
sells packages of Westlaw, Lexis, and 
HeinOnline records targeted at law 
libraries. A fundamental challenge 
of batchloading records therefore is 
keeping abreast of record availabil- 
ity. Subject selectors may not be in 
the habit of querying vendors about 
record sets, and records may become 
available for collections acquired many 
years earlier. The Bibload Group at 
Penn State has taken an increasingly 
proactive role in researching record 
availability both by encouraging selec- 
tors to consider record availability as 
an important aspect of any new pur- 
chase and by researching record avail- 
ability for sets the libraries already 
own or license. 

A batchloading project begins 
when either the Bibload Group or a 
subject specialist becomes aware of the 
availability of records for a collection 
that either has already been purchased 



or for which purchase is pending. 
Before the advent of online databases, 
most such sets acquired described 
microform collections that the librar- 
ies already owned but for which only 
a single collection-level record was 
available in the catalog. More recently, 
most of the sets acquired describe the 
titles constituting electronic aggregate 
resources. 

Acquisition of Files 

Some files are made freely available 
on a vendor's website. Other files, 
while free, must be requested, and the 
vendor may make them available via 
either a website or FTP, or send them 
as e-mail attachments. 

Purchasing sets of bibliographic 
records can be more complex, and 
Penn State has adopted two different 
models for the process. In some cases, 
Cataloging and Metadata Services 
allocate funds for the purchase, are 
invoiced directly, and must submit a 
purchase order through the libraries' 
Business Office. (Depending on the 
cost of the file, approval for the pur- 
chase from a single source may have 
to be secured from the university's 
Department of Purchases, a step that 
may delay the project and must be 
taken into account during the plan- 
ning phase.) In other cases, record 
sets are purchased with the collections 
fund; such purchases are initiated by 
staff in the Serials and Acquisitions 
Department exactly like purchases of 
items for the collection. 

Some vendors offer to modify 
records to suit local needs. For exam- 
ple, the American Antiquarian Society, 
which provides records for Early 
American Imprints, First Series, allows 
purchasers to select records for a par- 
ticular version (microopaque, positive 
microfiche, or negative microfiche), 
select which MARC field to use for 
the call number (090, 099, or other), 
and indicate what the base call num- 
ber should be. The OCLC provides 
a number of options for modifying 



record sets for both electronic and 
microform collections, including edit- 
ing 856 fields (used for access informa- 
tion for electronic resources), deleting 
fields on the basis of their MARC tag, 
adding call number fields, customizing 
call numbers by pulling information 
from more than one source (such as 
a series number), adding fields, and 
more. With the advent of the MarcEdit 
software (discussed later), Penn State 
performs all customizations on site 
rather than asking vendors to modify 
records prior to purchase. 

Acquisition of files has implica- 
tions for workflow, staffing, server stor- 
age space, and network security. File 
naming conventions must be adopted. 
Server space must be designated and 
permissions assigned to appropriate 
staff. Copies of files must be routinely 
created and stored in a location acces- 
sible to staff charged with manipulat- 
ing and loading files. 

Record Review and Evaluation 

Whether purchased from the OCLC, 
supplied by a vendor, or acquired 
from a third-party source, bibliograph- 
ic records intended for batchloading 
must be reviewed for quality. A pre- 
liminary check by the batchloading 
process manager determines whether 
the correct number of records has 
been delivered, whether the records 
describe the correct set of resources, 
and whether the records are in the for- 
mat agreed upon (usually USMARC 
21 using either MARC-8 or UTF-8 
encoding). Discrepancies are reported 
promptly to the supplier and arrange- 
ments made for a new file to be pro- 
vided. 

Software can be useful to deter- 
mine quickly whether a file meets 
validation rales, but human review by 
experienced catalogers and systems 
staff is considered essential. To facili- 
tate such review, a file is converted 
from MARC to text format and made 
available to members of the Bibload 
Group and other stakeholders. All 
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group members are expected to review 
a given number of records (at Penn 
State, twenty- five) within an agreed- 
upon time frame (e.g., five work- 
ing days) to determine whether the 
records meet local needs. After records 
are deemed acceptable by cataloging 
and systems staff, subject specialists 
may identify modifications intended 
to improve their usefulness to patrons, 
such as notes, links to online guides, or 
series fields. Using input from subject 
specialists and members of the group, 
the records are edited and prepared 
for load using a freeware software pro- 
gram called MarcEdit (http://oregon 
state . edu/~reeset/ marcedit/html/index 
.php) developed by Terry Reese. 

Record Modification 

All record sets require some modifica- 
tion before being loaded into the cata- 
log. For the Unicorn integrated library 
system at Penn State at least a 949 
field (containing the call number, clas- 
sification scheme, purchasing library, 
home location, item type, and flags to 
indicate circulation and permanence) 
must be added to each record. These 
elements are required by the CAT; if 
not supplied during batchloading, the 
information would have to be manual- 
ly added to each record after the load. 

Many sets require additional 
modification. Local notes are added to 
records for online resources to inform 
patrons that access to the resource is 
restricted to Penn State users. The 
address of the libraries' proxy server is 
pre-pended to URLs so that off-cam- 
pus users can authenticate to reach 
licensed products. Additional series 
statements may be added to assist 
in the retrieval of records using a 
single search. Links to guides available 
online may be added. In some cases, 
substandard record quality may neces- 
sitate corrections or modifications, 
such as converting 650 fields with 
indicators 14 (subject headings drawn 
from a local, usually nonstandard, the- 
saurus instead of from the Library of 



Congress Subject Headings) to 653 
uncontrolled keyword fields or batch 
correcting typographical errors. The 
Program for Cooperative Cataloging 
(PCC) Standing Committee on 
Automation has created a guide for 
use by vendors when creating sets of 
bibliographic records to accompany 
monograph aggregations. 9 In theory, 
this guide should help vendors and 
publishers create future products that 
are tailored to meet the needs of 
libraries. While our discussion with 
one vendor indicates some interest 
in conforming to national cataloging 
standards, our experience suggests 
that vendors may be slow to adopt 
practices that fully conform to current 
library standards for quality. 

Modifying Records Using MarcEdit 

MarcEdit has revolutionized the ways 
libraries can manage their MARC 
records. Until recently, libraries were 
dependent on local programmers or 
systems staff to modify large record 
sets. MarcEdit empowers library staff 
to do the work themselves quickly 
and effectively by providing a wide 
array of tools for manipulating files of 
MARC records: Fields may be added 
or deleted, global edits made, and data 
swapped from one field to another. 10 
In addition, MarcEdit's implementa- 
tion of regular expressions — known in 
the computing world as regexes, a con- 
cise and flexible means for identifying 
strings of text of interest, such as par- 
ticular characters, words, or patterns of 
characters — allows more sophisticated 
manipulation of data, such as building 
call numbers from data in multiple 
fields or selectively removing fields 
when certain data elements are pres- 
ent. Editing files locally is generally 
more flexible and more cost effective 
than requesting record customization 
from vendors. 

Developing Load Specifications 

The SirsiDynix Unicorn integrated 



library system allows several options 
regarding the batchloading of bib- 
liographic records. Of primary impor- 
tance is specifying how the unique 
record-specific identifier (title control 
number) is to be built during the 
load: from a numerical field in each 
record (e.g., 001, 020, 035) or sim- 
ply system-generated. The presence 
of unique record-specific identifiers 
is essential in allowing subsequent 
updating or overwriting of records. 
Also configurable is the load rule, 
which determines how new and dupli- 
cate records are handled. Finally, sev- 
eral parameters are set to specify how 
call numbers and copy information is 
generated during the load. 

Test Loads and Evaluation 

Before being loaded into the produc- 
tion catalog, each file is first load- 
ed onto the libraries' test server for 
review. Experience has shown that 
subject specialists and public servic- 
es librarians are more comfortable 
reviewing records in the CAT than 
as simple text files and that potential 
problems not readily apparent based 
on inspection of the MARC records in 
isolation often become obvious in the 
context of the catalog. Furthermore, 
a test load is crucial for verifying that 
call number, library, location, and cir- 
culation status data has been config- 
ured and loaded correctly. Finally, a 
test load also serves to determine how 
many, if any, records will be returned 
as duplicates and to evaluate what 
action should be taken to address such 
duplication. 

After the file is loaded into the 
test server, an e-mail message is sent 
to the Bibload Group and other stake- 
holders informing them of the avail- 
ability of the records for review in 
the test CAT. The message includes 
information about the size of the file, 
the number of error records (i.e., 
records returned as having failed to 
load), and instructions for retrieving 
the records in the catalog. Bibload 
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Group members and other interested 
parties are requested to review the 
records within five working days and 
to send comments or questions to the 
group. 

Production Load 

If, following the test load, stakeholders 
voice concerns that require modifica- 
tions to the records, a second test 
load may be undertaken to address 
the concerns raised. After approval 
of the final test load, files are loaded 
into production using the same report 
specifications as the test load. 

An e-mail message is sent to the 
Bibload Group and other stakehold- 
ers informing them of the availability 
of the records for review in the CAT. 
Although in principle the production 
load should have results identical to 
the approved final test load, this review 
of the production load is undertaken 
by the Bibload Group and stakehold- 
ers in the interest of quality control to 
ensure that no unanticipated effects 
have occurred. 

Off-Campus Access 

Access to purchased electronic 
resources is almost always limited 
to users affiliated with the purchas- 
ing institution. Many vendors use IP 
filtering to manage access, so, for 
example, authorized Penn State users 
attempting to access content from off 
campus (i.e., from non-Penn State IP 
addresses) find themselves blocked. To 
ensure access to all authorized Penn 
State users regardless of their physical 
location, the Bibload Working Group 
began modifying vendor- supplied 
URLs by pre-pending the address for 
the libraries' proxy server. On-campus 
users who click on the link are taken 
seamlessly to the resource, while 
off-campus users, if they have not 
already authenticated as PSU users, 
are required to log in with their Penn 
State access accounts, and are then 
passed through to the resource. 



Promotion 

Making the libraries' community 
aware of the newly loaded records is 
seen as a critical step in the batch- 
loading process. When the Bibload 
Working Group was first formed, little 
or no promotion was undertaken. The 
subject specialist most closely inter- 
ested in the load was informed that the 
records were available in the CAT, but 
no formal announcement was made to 
the libraries or the campus as a whole. 
Subject specialists were expected to 
make their constituents aware of the 
newly loaded records. 

In an effort to educate colleagues 
about the progress made in provid- 
ing access to hitherto hidden col- 
lections and to promote the work 
of the Bibload Group, global e-mail 
announcements are now sent to the 
entire Penn State Libraries communi- 
ty following each significant load. The 
announcements, drafted by the chair 
of the group in collaboration with 
the subject specialist, include a brief 
description of the collection's scope 
and importance as well as instructions 
for retrieving the records in the CAT. 
Such announcements not only provide 
information that allows the libraries' 
staff to provide better service to users, 
they also heighten awareness of the 
importance of batchloading and give 
credit to the members of the Bibload 
Working Group. 

Vendor-Supplied Authority Control 

Like many large academic libraries, 
Penn State sends records to an exter- 
nal vendor for authority control on 
a monthly basis. Large batchloading 
projects, especially those likely to cre- 
ate a sizable number of unmatched 
headings, are reported to the authori- 
ties librarian before the load takes 
place. In cases where series headings 
are added to files for the purpose of 
retrieval, series authority records are 
established in the Library of Congress 
Authority File (LCAF) prior to the 



production load of the file to ensure 
that records containing the new 
series are not returned as part of the 
unmatched headings report. 

Managing Catalog Extracts 

Many large record sets purchased from 
vendors may not, because of contrac- 
tual obligations, be supplied to the 
OCLC as part of the libraries' monthly 
holdings load. As a result, any ineli- 
gible records must be removed from 
the file before it is supplied to the 
OCLC. A file of unique record identifi- 
ers is generated and archived for every 
file that is batchloaded at Penn State. 
These files are used by systems staff to 
remove ineligible records prior to send- 
ing extract files to the OCLC and can 
also serve as a means for batch deleting 
large record sets in cases where the 
libraries cancel access to e-resources 
and must therefore remove records 
from the catalog. At Penn State the 
need to batch delete a batchloaded 
file has not yet arisen, but a similar 
procedure is used monthly to remove 
and then reload updated versions of Ex 
Libris's SFX records. 

Post-Load Cleanup 

Although one or more test loads can 
minimize errors, given the size and 
scope of most batchloading projects, 
which often involve tens of thousands 
of records, some post-load manual 
cleanup is inevitable. Records may fail 
to load, call numbers may load incor- 
rectly, and the bibliographic records 
may have problems that are difficult or 
impossible to correct using MarcEdit. 
During the test phase the Bibload 
Group, in consultation with stakehold- 
ers, may decide that a certain percent- 
age of errors is acceptable if correcting 
them after the load is easier or quicker 
than repeatedly modifying load specifi- 
cations. When such a decision is made, 
a document is drafted by the Bibload 
Group chair outlining the nature and 
extent of the anticipated cleanup 
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required. Depending on the resources 
required, one or more staff may be 
assigned to work on the project. 

Exposure to Risk and URL 
Management 

Unlike physical collections, e-resources 
are often hosted remotely on ven- 
dor or third-party servers over which 
libraries have no control. When these 
servers fail or when URLs change, 
large numbers of e-resources sud- 
denly may become inaccessible. The 
presence of title-level records in the 
online catalog heightens the effect 
of such technological glitches. Two 
approaches for managing such risk are 
routinely checking URLs and creat- 
ing backup copies of remotely hosted 
resources. Link-checking software, 
while useful for systematically verify- 
ing that URLs in the library catalog 
are functioning properly, usually gen- 
erates reports that library staff must 
review and process manually — a time- 
consuming procedure. Some vendors, 
such as Gale/Cengage Learning, sup- 
ply archival copies in XML format 
of digital content to libraries so that, 
in the event that the vendor's server 
becomes inaccessible, client librar- 
ies will be able to ensure access to 
the content from their own servers. 
Although this approach is sound in 
theory, it requires libraries to create 
and maintain a server infrastructure 
capable of providing seamless access 
to e-resources normally hosted off site. 
For many libraries, such a strategy may 
be impractical. Penn State has begun 
preliminary discussions for managing 
archival content on local servers but 
has not yet implemented any policies 
or procedures for doing so. 

Managing Ongoing Loads 

Some batchloaded files must be sup- 
plemented by updates. NetLibrary, 
for example, regularly adds titles to 
its collection, as does the American 
Council of Learned Societies (ACLS) 



Humanities E-Book Project. In other 
cases, vendors do not supply update 
files but instead provide new releases 
of entire record sets. In either sce- 
nario, provisions must be made for 
regularly acquiring and loading files 
and for ensuring that duplication is 
avoided. Managing ongoing loads can 
be especially challenging when ven- 
dors release updates irregularly, when 
updates are so small as to render the 
batchloading process less than ideally 
efficient, and when record quality is 
inconsistent, as was recently the case 
for the ACLS Humanities E-Book 
Project. Early batches of records treat- 
ed the project name (History E-Book 
Project) as a series statement, while 
subsequent installments treated the 
project name as a corporate body 
(History E-Book Project, which later 
became the ACLS Humanities E-Book 
(Organization)). Files had to be edited 
to remove the inconsistency. 

What the Future Holds 

The biggest challenges of managing 
batchloading projects are techno- 
logical and organizational. Validating 
large record sets, de -duplicating files 
to prevent duplicate records in the 
catalog, verifying that URLs function 
as intended, and ensuring seamless 
access to remotely hosted content in 
the event of server outages or other 
technological failures depend on soft- 
ware and hardware that continuously 
must be updated and maintained. 
MarcEdit, perhaps the most powerful 
software tool in the batchload toolkit, 
is in continuous development. Future 
users of the software may have access 
to even more powerful tools for vali- 
dating, editing, and converting biblio- 
graphic data. 

What effect the implemen- 
tation of the entity-relationship 
model of metadata recommended in 
IFLAs Functional Requirements for 
Bibliographic Records and its applica- 
tion through Resource Description 



and Access (the successor to the 
Anglo-American Cataloguing Rules) 
will have on catalog records and on the 
structure of the catalogs themselves 
remains to be seen. 11 Batchloading, 
which is largely based on the single 
flat record concept underlying current 
cataloging standards, will necessarily 
evolve as bibliographic databases are 
reconceptualized and restructured to 
better reflect the current landscape of 
information discovery and retrieval. 

Because batchloading requires 
expertise in a broad array of library 
areas (acquisitions, cataloging, systems 
administration, public service), staff 
skills must evolve to meet this chal- 
lenge. Cross-training, efficient mod- 
els of communication, and up-to-date, 
concise, accessible documentation of 
policies and procedures will all be 
essential elements of the batchloading 
workflow of the future. 



Conclusions 

Batchloading is a complex process, 
both technologically and organiza- 
tionally, requiring the coordination of 
resources from throughout a library. 
The experiences and processes devel- 
oped at Penn State can help other 
institutions make more informed deci- 
sions and devise policies and proce- 
dures most likely to ensure a successful 
batchloading workflow. 

Given the number of variables 
and the rapidly changing technologi- 
cal landscape, no single batchloading 
project fully exemplifies the process. 
Each load is different, requiring that 
all stakeholders be responsive to 
new opportunities and new challeng- 
es. Large gains in efficiency can be 
achieved by standardizing workflows 
and by carefully documenting proce- 
dures, but the process must be flexible 
enough to accommodate variations in 
the parameters, such as the size and 
quality of record sets, their cost, the 
likelihood that access to resources will 
become available through channels 
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other than the library catalog, and rap- 
idly changing user expectations. 

The goal of batchloading is 
improved access to the libraries' collec- 
tions. Every item or resource to which 
the libraries provide access should be 
represented in the catalog. Loading 
large bibliographic files is an especially 
effective means of working toward this 
goal, and is much more efficient than 
traditional piece-by-piece cataloging. 

Batchloading also allows improv- 
ing the granularity of the catalog. 
Traditionally, online catalogs have 
described a library's holdings at the 
item level (for books and monograph- 
like items in other formats) or at 
the collection level (for large micro- 
form collections, electronic resource 
aggregator databases, serial publica- 
tions, and archives and manuscript 
collections). As user expectations 
change and full-text databases become 
increasingly common, batchload- 
ing allows for greater granularity — 
providing title-level access for collec- 
tions for which only collection-level 
access was available previously and 
providing analytical access to items 
for which only title-level access was 
available. Batchloading improves what 



might be called the resolution of the 
catalog. Once a magnifying glass that 
allowed users to see a certain level 
of detail of the collections, the cata- 
log can be transformed over time 
into a powerful microscope allowing 
a more magnified and therefore more 
detailed examination of an institution's 
rich collections. 
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Appendix. Bibload Working Group Charge 

To manage the purchase, testing, and loading of sets of bibliographic records. Tasks will include: 

• Confirm funding source. 

• Complete record profile and deliver order to acquisitions staff or Business Office, as appropriate. 

• Upon delivery, review record quality. 

• Seek input from subject specialists regarding call number or other desirable edits to the bibliographic records. 

• Customize records to suit subject specialists' needs. 

• Prepare load specifications, consulting with subject specialists or library heads as appropriate. 

• Bun bibload report in test/development catalog, repeating as necessary. 

• Work with Digital Library Technologies staff to run bibload report in production catalog. 

• Inform the library community about availability of the records in the CAT. 
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Book Review 

Edward Swanson 



E-Journal Invasion: A Cataloger's 
Guide to Survival. By Helen 
Heinrich. Oxford: Chandos Pub., 
2007. 260p. £39.95 paper (ISBN 
1-84334-144-1); £59.95 hardcover 
(ISBN 1-84334-193-X). 

Written from a cataloging practi- 
tioner's point of view and set against 
the backdrop of rapid changes in jour- 
nals publishing, this book examines the 
changes in cataloging theory and prac- 
tice that have ensued from the rapid 
proliferation of electronic journals and 
aggregator databases. Heinrich states 
that the book is intended principally 
for cataloging managers who have the 
responsibility for developing the cata- 
loging policies and procedures that will 
define the resources discovery routes 
that govern how a library's users will 
gain access to electronic journals. It is 
also aimed at cataloging practitioners. 

The opening chapter deals with 
the effect of the Internet on the 
work of catalogers, starting with the 
MABC format familiar to most cata- 
logers and charting the development 
of the basic concepts within MABC 
that were originally established to 
describe physical printed works. 
From this familiar territory, Heinrich 
moves on to describe emerging meta- 
data schema, including MABCXML, 
Metadata Object Description Schema 
(MODS), and Dublin Core. She helps 
those new to such concepts to improve 
their understanding through the use 
of illustrations, numerous tables, and 
examples that outline both the data 
entry elements incorporated in each 
schema and the public views of dif- 
ferent record types taken from online 
library catalogs and the Internet. 

From this basic introduction to 
the problems that libraries have been 
facing and the development of new 
metadata schema, the author goes 



on to describe the changes made in 
the Anglo-American Cataloguing 
Rules, 2nd ed., Machine-Beadable 
Cataloging (MABC), and Cooperative 
Online Serials (CONSEB) rules, 
focusing on the revisions required 
in each standard to provide biblio- 
graphic control for remote electronic 
resources. Having completed this use- 
ful background information, Heinrich 
starts to address some of the real 
issues and decisions facing librar- 
ies, including the central problems 
of whether to adopt a user-friendly 
single cataloging record approach, 
which keeps all data on both the print 
and electronic versions of a journal 
within one catalog record, or to go for 
the administratively easier separate- 
records approach. The role of aggrega- 
tors or electronic journal package pro- 
viders is discussed at length, and the 
effect of the ever-changing journals 
market, with the consequent virtual 
impossibility of libraries being able to 
keep up with in-house catalog record 
creation and maintenance overheads, 
will be familiar to many journals cata- 
loging practitioners. 

As libraries have struggled to keep 
up with cataloging and record main- 
tenance tasks, a new broad market 
and increasing demand for commer- 
cial MABC services has developed. In 
examining a variety of vendors' MABC 
products, the author discusses many of 
the dilemmas encountered by librar- 
ies when considering acquisitions of 
such services. Heinrich describes and 
analyzes the complexities of incorpo- 
rating commercially produced catalog 
records into local library databases and 
offers some practical solutions to many 
of the most common questions and 
issues that libraries would face. 

In chapter 4 the author brings 
together all the theoretical and 



historical strands of the emergence of 
the Internet, new cataloging rules, and 
issues, and tries to put them in a prac- 
tical perspective by describing how 
all of these issues have been locally 
addressed at her library at California 
State University, Northridge. The uni- 
versity library's step-by-step imple- 
mentation of a commercial MABC 
record service is described in an 
attempt to "help libraries avoid feel- 
ing blindfolded during the course of 
implementation and post implementa- 
tion maintenance" (127). The informa- 
tion in this section is also backed up 
with quotations and references from 
other libraries across North America 
and Europe, giving readers easy access 
to journal articles that have addressed 
many of the key themes of single 
versus separate online public access 
catalog records, e-resource cataloging 
practice, and the effects of e-journal 
management tools and services on 
serials cataloging. The book concludes 
with a look into the future of catalog- 
ing generally, citing the "noticeable 
shift from 'deep' quality cataloguing 
to 'light' cataloging" (195) and the 
move to the supersizing of catalog- 
ing as libraries have graduated from 
single record downloading to bulk 
ingesting of files with hundreds or 
thousands of records. New develop- 
ments such as metasearching or fed- 
erated searching, the emergence of 
the Open UBL standard, the use of 
Digital Object Identifier (DOI)-based 
linking, and open access initiatives 
are briefly described, continuing the 
theme of providing basic introductions 
and descriptions to key themes and 
developments. 

Although from the outset Heinrich 
states that the book is intended for 
cataloging practitioners and cataloging 
managers, she also acknowledges that 
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it may be useful to vendors and com- 
mercial suppliers that provide online 
journal services to libraries. To this list 
could also be added serials librarians, 
for whom responsibility for cataloging 
journals may be a new and daunt- 
ing responsibility with a minefield of 
acronyms, issues, and standards that 
need to be safely crossed, or simply as 
background for understanding the dif- 
ficulties faced by their cataloging col- 
leagues in describing and facilitating 
access to the collections they manage 



and administer. 

Although numerous journal arti- 
cles have been published addressing 
many of the issues raised and discussed 
in the book, this work is unique in its 
attempt to chart the historical context 
of developments in this field of librari- 
anship and put them in perspective for 
those facing the challenges of handling 
and managing electronic resources 
today. Heinrich's book is not only very 
readable as a complete work but can 
also be used as a quick reference guide 



for those wanting to look up specific 
terms and acronyms or read a case 
study of an actual implementation of 
an e-journals cataloging service from 
an external vendor. The work provides 
a very useful comprehensive overview 
of all the issues and developments and 
as acts as a one-stop shop for those 
wishing to gain a better understanding 
of the complexities of current-day seri- 
als cataloging. — Helen Adey, (helen 
.adey@ntu.ac.uk), Nottingham Trent 
University, Nottingham, England. 
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